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Abstract 


\)  A  user  microprogramming  capability  was  implemented  on  AFIT's  HP 

1 

21MX  RTE-III  computer  system.  The  AFIT  computer  will  be  used  to  sup¬ 
port  student  research  in  microprogramming  projects  involving  real  time 
digital  signal  processing  and  special  time  critical  programs  for  mili¬ 
tary  environments. 

This  study  further  looks  at  a  specific  microprogramming  technique 
to  tailor  application  programs  for  improving  a  programs  execution  time. 

A  feasibility  study  to  analyze  program  activity  for  microprogram  improve¬ 
ments  is  done  for  the  fast  Fourier  transform.  The  Bit  Reversal  sorting 
routine  is  microcoded  to  demonstrate  the  technique.  Response  of  the  FFT 
programs  is  analyzed  using  an  activity  profile  generator  program,  and  the 
difference  in  execution  speed  of  the  programs  with  and  without  the  micro¬ 
programmed  bit  reversal  routine  is  measured  and  compared. 


MICROPROGRAMMING:  A  TOOL  TO  IMPROVE  PROGRAM  PERFORMANCE 


I .  Introduc  Lion 

The  topic  is  motivated  primarily  by  the  desire  to  promote  use  of 
a  powerful  tool  for  tailoring  application  programs  such  as  those  em¬ 
ployed  in  digital  signal  processing,  post-data  reduction  or  computer 
graphics  to  meet  special  requirements.  For  example,  real  time  execu¬ 
tion,  accuracy  or  special  wcrdlength  requirements  are  often  required 
when  interfacing  hardware  equipment  to  the  computer.  This  study  how¬ 
ever  deals  primarily  with  using  microprogramming  to  improve  a  programs 
execution  time,  whether  a  real  time  application  or  other.  This  chap¬ 
ter  presents  the  objectives  and  background  requirements,  problem  and 
scope  for  undertaking  this  investigation. 

Background 

A  significant  number  of  laboratory  facilities  at  Wright-Fatterson 
AFB,  including  the  Air  Force  Institute  of  Technology  (AFIT)  use  UP- 
2 If IX  computer  systems  for  a  variety  of  post-data  processing  analysis 
and  real  time  data  acquisition,  digital  signal  and  control  tasks.  The 
Electronic  Warfare  Analysis  Facility  (EWAF),  managed  by  the  Electronic 
Warfare  Division  ASD/ENAD,  is  one  such  facility  which  uses  a  21MX  (2112) 
computer  system  to  support  a  wide  range  of  Sl’O  related  electronic  war¬ 
fare  (EW)  tests,  evaluation  and  analysis  tasks.  Present  EWAF  applica¬ 
tions  include  in-aouse  and  contracted  tasks  for  the  preparation,  acqui¬ 
sition,  p  :v  .  a  tat ;  on  and  pcst-oro.  <.  :  . :  a  ;  m'  EW  s  i  ,.;.i  1  .iV  i  >.i  l  •  i  data  and 
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antenna  measurement  data.  Future  applications  include  real  tine  digi¬ 
tal  signal  waveform  processing  tasks  and  program  simulations  for  EW 
analog  models  of  various  threat  receivers  for  analyzing  response  to 
jamming  waveforms. 

One  unique  feature  the  21MX  system  supports  is  a  user  dynamic 
microprogramming  capability  (Ref.  1:3-15).  This  feature  gives  a  user 
the  flexibility  to  augment,  modify  or  completely  replace  the  computers 
basic  machine  instruction  set  or  to  program  special  purpose  algorithms 
to  meet  specific  application  needs.  For  example  in  real  time  digital 
signal  processing  work,  throughput  and  execution  speed  are  generally 
of  prime  importance.  By  using  appropriate  microprogramming  techniques, 
execution  speeds  can  be  significantly  improved.  In  other  applications, 
such  as  data  reduction  tasks,  special  instructions  could  be  microcoded 
to  improve  accuracy  beyond  that  which  the  base  computer  can  provide 
and  at  the  same  time  improve  throughput  performance. 

Unfortunately  microprogramming  is  somewhat  specialized  and  complex. 
The  technique  is  still  viewed  .is  a  difficult  science  by  many  computer 
users  who  have  not  been  sufficiently  exposed  to  microprogramming  con¬ 
cepts  to  understand  and  put  to  use  its'  many  benefits  (Ref.  2,  Ref. 
3:210).  And  most  end  users  do  not  realize  its  ability  to  speed  up  exe¬ 
cution  of  time-critical  routines. 

Microprogramming  concepts  were  first  introduced  over  30  years  ago 
by  M.  V.  Wilkes  (Ref.  ',)  but  upp  i  teat ions  ace  sc  Li  l  in  an  early  stage 
of  development  (Ref.  1:3-19).  Wilkes'  application  of  microprogramming 
delt  primarily  with  hardware  for  designing  the  control  processor  unit 
(CPU)  (Rof.  1:1).  Todav  however,  with  the  ndvant  of  user  nil  ropmqr.an- 
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nuble  computers,  fast  semi-conductor  memories,  and  cheap  microprogram- 
mable  bit-slice  microcomputer  chips  (Ref.  5:98),  interest  in  using 
microprogramming  techniques  in  non-cpu  applications  is  rising  rapidly 
(Ref.  6:33).  The  availability  of  the  HP-21MX  user  microprogrammable 
computer  makes  it  practical  to  use  and  explore  microprogramming  tech¬ 
niques  for  a  wide  range  of  applications. 

Microprogramming  in  general  is  more  difficult  and  time  consuming 
then  writing  programs  in  FORTRAN  or  ASSEMBLY  languages  and  requires 
the  programmer  to  know  more  about  the  specific  internal  timing  and 
architecture  of  the  computer  (Ref.  7:33).  The  HP  microprogramming 
language  however  is  very  similar  to  writing  programs  in  ASSEMBLY  lan¬ 
guage  and  is  only  slightly  more  difficult  to  learn.  A  relatively 
simple  nnumonic  language  is  used  by  HP  and  the  resulting  code  assem¬ 
bled  using  a  microassembler.  Microprogram  instructions  are  stored  in 
a  special  high  speed  random  access  memory  called  a  Writable  Control 
Store  (WCS). 

Because  microprogramming  is  more  specialized,  complex  and  time 
consuming,  HP  end  users  may  shy  away  from  developing  specific  applica¬ 
tions  which  could  be  of  benefit.  AFIT  could  provide  a  significant 

I 

focal  point  for  HP  users  at  Wright-Patterson  AFB  and  elsewhere  by  ex¬ 
perimentation  through  AFIT  research  projects  involving  microprogramming 
applications.  For  example  reaL  time  input-output  problems,  digital 
s  ignal  processing  application.,  .ir.d  special  ir.cu  tuning  ol  user  programs 
could  be  studied  to  enhance  execution  speed,  accuracy,  and  word  length 
considerations.  The  benefits  include  extending  the  useful  life  of  HP 
equipment  at  the  various  labs  at  VP AFB,  developing  more  usage  and  better 
understanding  for  a  very  powerful  computer  application  tool,  and  stude.nt 
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training  in  microprogramming  techniques  and  computer  control  designs. 
Additionally  firmware  concepts  being  used  in  Air  Force  avionics  systems 
could  be  effectively  demonstrated  to  students  resulting  in  a  greater 
awareness  and  appreciation  for  the  technology. 

Problem 

The  problem  this  investigation  addresses  is  implementing  a  system 
to  support  dynamic  user  microprogramming.  Two  basic  resources  are  re¬ 
quired  to  undertake  this  study.  The  first  is  a  user  microprogrammable 
computer.  The  second  is  a  reasonable  operating  system  which  supports 
microprogramming.  The  first  requirement  is  met  with  Hewlett-Packard's 
21MX  computer.  The  second  requirement  is  met  with  Hewlett-Packard's 
Real-Time  Executive  (RTE)  operating  system.  Neither  the  RTE  operating 
system  or  the  microprogramming  capability  has  been  implemented  on  AFIT's 
HP  21  (2108)  computer  equipment.  In  addition,  this  study  investigates 
a  specific  non-cpu  application  of  microprogramming  relating  to  EW  simu¬ 
lation  and  data  processing  programs.  The  conceptual  use  of  HP  micro¬ 
programming  to  tailor  digital  signal  processing  and  post  data  processing 
application  programs  for  improving  execution  response  is  studied.  The 
concept  of  using  microprograms  to  tailor  the  machine  or  application  pro¬ 
grams  to  meet,  specific  npp.1  ications  is  not  new  (Ref.  7).  Part  of  this 
latter  problem  consist  of  identifying  appropriate  means  and  general  cri¬ 
teria  to  be  applied  for  selection  of  application  programs  which  show 
promise  of  being  improved  by  microprogramming  techniques. 

Scope 

The  development  of  the  microprogramming  capability  making  up  this 


study  includes  the  following: 


1.  The  installation  and  checkout  of  a  256  word  writable 
control  store  (WCiJ)  memory  module. 

2.  RTE-Iil  system  generation  to  include  microprogramming. 

3.  Implementation  and  checkout  of  various  HP  software 
support  tools  necessary  for  microprogram  development 
in  an  RTE  operating  environment. 

4.  An  RTE-III  backup  generation  on  disk. 

5.  Identification  of  HP  decumentation  and  survey  of  micro¬ 
programming  differences  between  HP  2100  and  HP  21MX 
computers  (M,  E,  F  series). 

6.  A  base  survey  of  HP  users. 

7.  A  literature  search. 

8.  Activity  analysis  of  FFT. 

9.  Development  of  a  microcoded  bit  reversal  sorting 
routine. 

Approach 

A  limited  survey  study  was  made  of  the  available  signal  process¬ 
ing,  post  data  processing  and  related  EW  evaluation  programs  being 
supported  now  or  planned  in  the  future  by  the  EVAF  facility.  From 
this  survey  several  programs  were  chosen  and  examined  more  closely  to 
determine  where  and  how  microprogramming  could  be  used  to  an  advantage 
to  improve  and  tune  execution  performance.  An  activity  profile  gen¬ 
erator  routine  was  used  to  monitor  the  software  activity  for  program 
analysis. 

A  detailed  profile  analysis  for  two  similar  FORTRAN  FFT  programs 

was  Joik-  using  the  a<  i.  i  v  i  ty  proiile  generator  program.  A  bit  reversal 
sure  i  n  i a  -er  i  i  !'■  r  i  '.  ■  - ■  -  '  ■  •  .  !’  '■ ! >  i  .  -  '  :  e r. „•  : 
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program  was  incorporated  as  a  subroutine  call  in  the  FFT  programs  to 
demonstrate  the  feasiblity  of  improving  program  execution  tine  using 
microprograms.  N'o  special  attempt  was  made  to  implement  the  fastest 
or  most  accurate  microcode.  The  programs  were  tested  and  the  improve¬ 
ment  in  execution  tine  determined  and  compared  with  that  predicted 
from  the  activity  profile  analysis. 

The  limited  time  available  for  this  project  did  not  permit  ex¬ 
plicit  investigation  of  some  important  tradeoff  options  such  as  pro¬ 
gram  accuracy,  word  length  considerations  and  interrupt  status.  The 
reader  should  be  aware,  however,  such  tradeoff  variables  are  extremely 
important  considerations  for  designing  microprograms  to  optimize  ex¬ 
ecution  response,  especially  for  time  critical  applications.  For  ex¬ 
ample,  checking  interrupt  status  is  a  primary  responsibility  the  micro- 
programmer  must  consider  when  writing  a  microprogram  (Ref.  8:7-21). 
Interrupts,  of  course,  affect  the  speed  of  the  microprogram.  If  the 
interrupt  rate  is  too  high  the  speed  benefits  offered  by  microprogram¬ 
ming  may  be  lost. 


Sequence  of  Presentation 

This  report  documents  the  results  of  an  investigation  conducted 
during  the  spring  and  summer  quarters  of  1981  at  AFIT  to  implement  a 
microprogramming  capability  on  AFIT's  HP  21MX  computer.  Chapter  I 
has  presented  the  objectives  and  background  requirements  for  under¬ 
taking  the  investigation.  In  order  to  put  this  report  into  context 
Chapter  II  presents  a  brief  overview  of  the  microprogramming  concept, 
writable  control  store  and  the  activity  profile  concept.  Chapter 
III  discusses  tile  survey  of  the  electronic  kmc;'  ire  programs  examined. 
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preseats  the  general  criteria  used  to  select  several  programs  for 
analysis  with  an  activity  profile  generator  and  presents  the  analy¬ 
sis  results  which  led  to  the  selection  of  the  two  FFT  programs  for 
demonstrating  the  microprogramming  capability.  However  before  any 
microprogramming  could  actually  be  done  in  this  study  it  was  first 
necessary  to  put  together  AFIT's  21MX  computer  system.  Chapter 
IV  briefly  discusses  the  implementation  phase,  the  system  genera¬ 
tion  required  and  Che  HP  microprogramming  support  software  avail¬ 
able.  Then  in  Chapter  V  the  design  and  test  of  an  FFT  bit  rever¬ 
sal  algorithm  in  microcode  is  considered.  Finally  in  Chapter  VI 
the  results  and  conclusions  of  the  study  are  presented  and  recom¬ 
mendations  for  further  development  are  made. 
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I I .  Microprogramming  Concept 

Introduction 

Microprogramming,  in  general,  is  a  very  powerful  and  useful  tool. 
This  chapter  establishes  that  microprogramming  has  the  unique  capabil¬ 
ity  to  significantly  reduce  execution  time  for  a  computer  program.  Al¬ 
though  other  uses  of  microprogramming  exist,  the  first  section  briefly 
explains  how  microprograms  provide  a  speed  gain  to  decrease  overall 
software  program  execution  time.  Microprograms  usually  reside  in  a 
special  memory  in  the  computer  control  section  called  a  writable  con¬ 
trol  store.  The  second  section  emphasizes  some  of  the  important  pro¬ 
perties  of  the  writable  control  store  as  a  system  resource.  The  next 
section  higlights  some  of  the  advantages  and  disadvantages  of  micro¬ 
programming  and  its  application  to  computers  in  general  as  well  as  the 
HP  21MX  computer.  To  apply  microprogramming  techniques  effectively,  one 
needs  to  know  where  and  what  to  microprogram.  The  fourth  section  dis¬ 
cusses  one  useful  tool,  the  activity  profile,  for  determining  where  to 
apply  microprogram  techniques.  The  last  section  covers  the  activity  pro¬ 
file  concept  and  how  to  generate  a  profile. 

How  Microprogramming  Provides  a  Speed  Gain 

In  general,  microprogramming  provides  a  speed  gain  from  the  follow¬ 
ing  five  sources: 

1.  Elimination  of  instruction  fetches. 

2.  Faster  memory  components. 

3.  Extra  storage  registers. 

4.  Parallel  process  in ■». 

5.  Morr*  if  *  vt  • 
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With  microprogramming  a  user  ran  eliminate  memory  fetch  instruc¬ 
tions  in  a  software  program  by  combining  a  series  of  machine  instruc¬ 
tions  into  one  single  microprogram.  Usually  in  a  software  program 
each  machine  instruction  requires  a  fetch  from  main  memory.  In  con¬ 
trast,  only  one  fetch  from  main  memory  is  required  to  execute  the 
single  microprogram.  Since  a  memory  fetch  takes  approximately  35%  to 
45%  of  the  CPU  time  (Ref  2:11),  the  overall  software  program  execution 
time  is  decreased  in  direct  proportion  to  the  number  of  machine  instruc¬ 
tions  replaced.  Thus  a  significant  time  savings  can  be  realized  by  this 
factor  alone. 

The  other  factors  listed  above  also  contribute  to  additional  in¬ 
creases  in  speed.  Microinstructions  reside  in  a  special  high  speed  mem¬ 
ory  in  the  control  store  section  of  the  computer.  The  control  store  mem¬ 
ory  may  be  either  Read  Only  Memory  (ROM)  or  special  Writable  Control 
Store  (WCS)  memory.  It  is  important  to  note  that  this  control  store  mem¬ 
ory  is  independent  of  the  main  memory.  The  cycle  time  required  to  ac¬ 
cess  the  control  memory  is  2  to  5  times  faster  than  for  main  memory. 

User  microprograms  can  reside  in  either  WCS  or  be  "burned"  into  non¬ 
volatile  PROM  chips  which  can  be  installed  on  User  Control  Store  (UCS) 
boards.  The  microprogrammed  computer  instruction  set  for  the  HP  comput¬ 
er  for  example  is  burned  into  a  sot  of  ROM  chips  and  reside  on  a  special 
board  mounted  below  the  motherboard.  As  a  result  of  using  faster  memory 
components  to  store  microprogram  instructions,  efficiency  is  increased 
2  to  5  times  that  of  calling  individual  machine  instructions  according 
to  Hewlett  Packard. 

In  addition  to  the  control  store  being  more  than  twice  as  fast  as 
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main  memory,  microinstructions  have  access  to  many  internal  high 
speed  scratch  pad  registers  that  main  memory  machine  instructions 
cannot  use.  What  this  means  is  that  having  more  scratch  pad  regis¬ 
ters  will  result  in  fewer  main  memory  fetches  and  thus  achieve  faster 
execution  (Ref  9:8). 

Each  microinstruction  word  in  the  HP  21MX  is  24  bits  wide  verses 
16  bits  for  the  software  machine  instruction.  The  24  bit  words  are 
divided  into  fields  which  directly  define  the  control  steps  to  be 
executed  within  the  system.  With  the  2LMX  and  other  microprogrammable 
machines  each  field  performs  a  different  micro  operation,  often  inde¬ 
pendently  of  one  another.  Consequently  many  different  operations  can 
be  performed  in  parallel  to  maximize  efficiency.  This  is  in  contrast 
to  the  machine  instruction  which  can  usually  only  perform  a  single 
operation.  It  is  important  to  observe,  however,  that  each  operation 
performed  by  a  machine  instruction  is  actually  a  series  of  microin¬ 
structions  which  defines  the  operation.  The  16  bits  making  up  the 
machine  instruction  in  fact  defines  the  starting  location  of  a  micro¬ 
program  which  performs  the  machine  operation. 

Finally  more  arithmetic  and  logical  functions  are  available  on 
the  microcode  level  which  main  memory  programs  cannot  use.  This 
makes  it  possible  to  take  full  advantage  of  the  machines  architec¬ 
ture,  internal  registers  and  logical  features  to  more  precisely  match 
the  implementation  of  a  program  operation  or  algorithm.  Consequently 
program  efficiency  can  be  optimized. 

Writable  Control  Store  as  a  System  Resource 

in  tin'  previous  section  it.  was  explained  I  ii.it  m  i  o  ran  rog  rutmu  i  ng 
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achieves  a  speed  gain  through  the  use  of  faster  memory  components,  i.e. 


either  ROM  or  Writable  Control  Store  modules.  Because  "...  many  users 
do  not  fully  understand  what  WCS  is,  what  it  can  do,  and  most  important, 
what  it  means  to  them."  (Ref  10:16)  a°  a  user,  this  section  will  high¬ 
light  some  of  the  properties  of  the  WCS  as  a  system  resource. 

First,  WCS  is  simply  a  high  speed  memory  device.  It  is  independent 
of  the  main  computer  memory  system.  It  is  used  to  hold  user  micro¬ 
instructions.  In  contrast,  macroinstructions  (machine  language  instruc¬ 
tions)  always  reside  in  main  memory.  Usually  the  WCS  word  is  much  wider 
than  a  main  memory  instruction  word. 

Second  WCS  is  an  I/O  device.  What  this  means  is  the  computer  can 
perform  input  and  output  operations  to  the  WCS  device  like  any  other  I/O 
device.  This  permits  storing  user  microprograms  in  main  memory  or  on 
disk  files  and  later  loading  them  into  the  WCS  as  required.  This  makes 
it  possible  to  alter  the  computer  operations  dynamically  as  a  main  mem¬ 
ory  program  executes  if  a  WCS  option  is  installed. 

The  WCS  board  is  simple  to  install  in  the  computer.  It  fits  into 
an  I/O  slot  provided  in  the  printed  circuit  assembly  (PCA)  cage  of  the 
computer. 

Third,  one  important  factor  which  must  be  weighted  befored  using 
a  WCS  for  program  production  or  control  tasks  is  the  fact  that  the  WCS 
memory  is  volatile.  If  power  fails  in  the  computer  the  instructions  in 
WCS  will  be  lost.  Thin  could  result  in  a  dangerous  si  tint  ion  if  the 

computer  is  controlling  a  hazardous  operation.  Consequently  this  factor 
of  volatility  must  be  weighed  with  the  application  when  deciding  whether 
to  use  ROM  or  WCS . 

Fourth,  adding  more  WCS  will  improve  the  computers  capability  and 
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flexibility.  WCS  is  available  for  the  HP  system  in  various  sizes  either 
a  256  word  board  or  a  lk  word  board.  Several  boards  may  be  installed  in 
the  computer  system.  This  means  longer  and  more  complex  programs  can  be 
accommodated  but  also  implies  longer  execution  of  a  microprogram  will 
result.  This  will  make  it  necessary  then  to  consider  the  problem-of 
program  interrupts  when  designing  long  microprograms.  Fortunately  most 
microprograms  are  relatively  short. 

The  writable  control  store  is  the  key  to  user  microprogram  develop¬ 
ment  in  the  HP  21MX  RTE  operating  system.  With  the  WCS  option  installed 
the  user  can  load,  execute  and  debug  his  microprograms  on-line  while 
operating  with  RTE  system.  The  WCS  also  allows  improved  interaction 
with  the  RTE  editor,  disk  file  manipulations  and  so  on  for  easier  micro¬ 
program  development  (Ref  11:15). 

Further,  in  HPs  real  time  multiprogramming  environment  many  uers 
can  access  a  single  set  of  microprograms  resident  in  WCS  or  multiple 
user  programs  can  use  the  same  WCS  area  and  sequentually  load  in  differ¬ 
ent  microprograms.  And  finally,  if  more  than  one  WCS  module  is  avail¬ 
able  each  module  can  be  used  independently.  This  permits  a  structure 
where  a  combination  of  separate  and  identical  microprograms  can  be 
accessed  by  multiple  users  (Ref  11:16). 

It  should  be  pointed  out  however  that  the  sharing  of  WCS  can  pose 
some  major  coordination  problems  (Ref  12:21). 

< 

Advantages  and  Disadvantages  of  Microprogramming 

Some  of  the  advantages  offered  by  microprogramming  are  as  follows: 

1.  Speed  enhancement 

2.  Flexibility 

2.  T  ; :  '.'t-  ’  <:>  i 
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4.  Vertical  microinstructions  allow  easy  microprogramming 

5.  Instruction  enhancement 

6.  Special  instructions 

7.  More  control  than  assembly  language 

8.  Excellent  software  support  tools  offered  by  HP 

9.  Dynamic  user  microprogramming  capability 

The  speed  enhancement  which  microprogramming  provides  has  been 
discussed.  But  microprogramming  offers  more  than  just  a  speed  advan¬ 
tage  as  seen  from  the  list  above.  Microprogramming  also  permits  a 
great  flexibility  in  both  the  design  of  computers  and  in  the  definition 
of  the  computers  base  instruction  set.  With  a  microprogrammed  computer 
this  flexibility  permits  the  freedom  to  tailor  the  insturction  set  for 
special  instructions  or  to  enhance  existing  instructions  as  new  or  bet¬ 
ter  algorithms  become  available.  In  addition  the  HP  21MX  computer  is 
user  microprogrammable  which  means  anyone  can  use  this  feature;  it  is 
not  just  limited  to  the  vendor.  But  more  importantly,  the  HP  micro¬ 
programming  language  uses  a  vortical  or  diagonal  (Ref  13:14)  micro¬ 
instructions.  This  type  of  microinstruction  is  simpler  to  use  and 
makes  the  HP  microprogramming  language  relatively  easy.  To  aid  user 
microprogramming  development,  various  software  support  tools  are 
uaually  necessary.  Hewlett  Packard  solves  this  by  providing  a  pow¬ 
erful  debug/editor  program. 

Some  of  the  disadvantages  offered  by  microprogramming  are  as  fol¬ 
lows: 

Time  to  write  microprograms  -  not  for  day  to  day  programming 

Documentation  and  maintenance  may  be  difficult 

Easy  to  crash  system  -  best  to  debug  on  a  dedicated  system 


Software  not  machine  transportable 

Verification  procedure  is  difficult 

The  above  disadvantages  of  microprogramming  are  mostly  self- 
explanatory.  Instead  of  stepping  forward  to  an  easier,  higher  ordered 
and  more  structured  language,  the  user  is  actually  taking  a  step  in  the 
opposite  direction  to  an  almost  unstructured  language  in  which  many 
events  can  take  place  in  parallel  and  at  random  times.  This  makes 
microprogramming  orders  of  magnitude  more  difficult  to  write,  document 
and  maintain. 

Becuase  microprograms  control  the  basic  machine,  generally  at 
the  gate  level,  a  great  potential  exists  for  making  disastrous  errors 
which  could  literally  destroy  any  operating  system,  files  and  so  forth 
on  the  machine.  As  a  consequence,  it  is  wise  not  to  develop  micropro¬ 
grams  on  a  sys  :m  which  is  running  production  ^obs  or  controlling  hazar¬ 
dous  applications. 

Usually  microprogramming  languages  are  computer  specific,  even 
among  the  same  computer  types.  This  means  microprograms  developed  on 
one  machine  may  not,  necessarily,  work  on  another.  Likewise  software 
programs  that  use  the  microprograms  will  not  work  on  other  machines. 
Because  of  the  many  interactions  possible  in  microprogramming ,  verifi¬ 
cation  of  a  microprogram  is  difficult.  Until  better  tools  are  developed 
to  test  microprograms  this  will  be  an  area  which  will  hinder  user  micro¬ 
programming. 

Some  of  the  limitations  of  the  HP  21MX  are  as  follows: 

1.  Speed  .  320  ns  cycle  time 

2.  Re.ul/Wr  ito  .  Takes  two  cycles 

3.  WHS  .  I. ini  r*d  : '  no 
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In  the  21MX  M-series  computer,  the  basic,  microinstruction  time 
takes  325  ns  to  complete.  This  is  the  primary  limiting  factor  on 
speed  for  real-time  applications.  Another  limiting  factor  occurs 
when  reading  or  writing  from  main  memory.  Two  microinstruction 
cycles  are  required  for  either  a  read  or  write  operation.  Most 
microprograms  tend  to  be  short  so  a  large  WCS  is  not  always  required. 
However  as  applications  grow,  the  size  of  microprograms  will  likely 
grow  also.  The  M-series  is  limited  by  the  WCS  area  it  has  available. 
About  2k  words  of  WCS  are  available  for  user  microprogramming  in  the 
M-series.  This  may  not  be  sufficient  area  for  large  user  micropro¬ 
grams  especially  those  which  use  complex  operations  and  floating  point 
math. 

Analyzing  Where  to  Microprogram 

Having  identified  the  reasons  why  microprogramming  can  speed  up  a 
program  and  understanding  what  WCS  is, and  can  do  for  use  we  now  turn  to 
applying  these  methods. 

Theoretically  one  would  want  to  write  an  entire  application  pro¬ 
gram  in  microcode  to  achieve  maximum  performance.  Suctt  a  task  would 
be  huge.  Since  the  available  WCS  area  in  most  mic reprogrammable  com¬ 
puters  is  relatively  small,  the  task  would  also  be  impractical. 

An  alternative  solution  is  to  microprogram  portions  of  software 
which  are  frequently  used.  This  is  the  microprogramming  concept  of 
tuning  and  involves  analyzing  the  application  program  to  determine  * 
areas  which,  if  replaced  with  microcode,  will  result  in  significant 
savings  of  overhead.  It  is  often  stated  that  90  percent  of  a  program 
execution  time  is  sent  in  LO  percent  of  the  .ictuaJ  software  code  (i!ct 


15 


2:7,  Ref  14:83-87,  Ref  13:56).  If  this  is  true  then  significant  sav¬ 
ing  in  execution  time  can  result  without  expending  a  great  deal  of 
microprogramming  time.  It  also  imples  microprograms  will  be  short 

thus  reducing  the  amount  of  effort  spent  in  writing  the  code.  But  '• 
this  leads  to  the  real  problem  of  identifying  the  software  code  which 
is  limiting  the  programs  performance. 

How  does  one  easily  identify  inefficient  code,  for  example  in 
large  software  programs?  Determining  what  section  of  a  program  is  ac¬ 
tually  inefficient  is  not  always  an  obvious  task  (Ref  11:15).  In  the 
area  of  program  timing  quite  often  one  is  mislead  by  intution  (Ref  12: 

18-19).  A  portion  believed  to  be  taking  the  most  execution  tine  may  not 
actually  be  doing  so.  Determiniation  of  what  to  microprogram  in  a  speci¬ 
fic  application  program  is  a  function  of  many  things.  The  process  gen¬ 
erally  requires  a  critical  analysis  of  the  programs  activity  to  deter¬ 
mine  inef ficiencies  accurately .  For  purposes  of  analysis  several  methods 
exist : 

*  Measure  system  time  using  a  watch  or  the  computers  built  in  clock 
to  time  the  various  sections. 

*  Use  a  special  hardware  tool  known  as  a  breakpoint  register  such  as 
the  HP  10676A. 

*  Cenerate  an  activity  profile  of  the  program. 

The  first  method  is  simple  but  may  require  considerable  insight 
into  wh.it  to  measure  since  ail  time  informal. iuu  is  reduced  lo  a  single 

number. 

The  second  method  is  extremely  useful  and  accurate  but  requires 
special  hardware  be  used  (Ref  12:99). 
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The  third  method  is  the  most  useful.  All  activity  profile  is  simply  a 
table  of  frequency  counts  of  a  computers  program,  address  counter  (P- 
register).  This  data  is  plotted  to  show  the  percentage  of  time  a  pro¬ 


gram  spends  in  each  area  of  memory.  Snook  in  his  paper  (Ref  14:33-57) 
remarks  that  a  profile  is  useful  for  several  reasons.  One  is  to  locate 
the  areas  where  a  program  spends  most  of  its  time.  Two,  it  identifies 
those  areas  which  use  very  little  time  (therefore  improving  these  areas 
is  usually  a  waste  of  tine).  Finally,  one  gains  a  better  insight  into 
the  program  being  used. 

Snook,  in  addition,  points  out  the  profile  generator  helps  reduce 
the  amount  of  guess  work  involved  in  optimizing  a  program  by  pinpoint¬ 
ing  the  bottleneck  areas  which  may  require  a  simple  recoding  or  rede¬ 
sign  of  an  algorithm  to  make  it  more  efficient. 

Activity  Profile  Concept 

Several  ways  exist  to  generate  an  activity  profile.  The  simplest 
method  is  to  just  halt  the  computer  a  number  of  times  while  the  pro¬ 
gram  is  executing  and  tabulate  the  addresses  of  the  p-register.  The 
drawbacks  to  this  method  are  obvious. 

A  second  method  is  to  use  the  computer  itself  to  monitor  and 
collect  the  execution  activity  of  a  program  while  it  is  running.  For 
this  purpose  a  simple  program  called  an  ACTIVITY  PROFILE  CFAiERATOR  is 
often  used.  This  is  a  software  program  which  periodically  interrupts 
the  program  being  monitored  and  simply  checks  the  p-register  address 
and  stores  the  results  for  latter  analysis.  Usually  the  address  range 
of  the  program  can  be  specified  which  also  allows  the  user  to  zoom  in 
on  specific  areas  ol  the  program  which  may  be  of  high  interest. 
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When  the  users  program  ends  the  activity  profile  data  can  then  be 
analyzed  by  means  of  a  histogram  plot  or  cummulative  probability  plot 
of  the  data.  By  comparing  the  output  results  with  the  loader  address 
map  for  the  program  one  can  readily  isolate  areas  of  high  and  low  acti¬ 
vity  in  the  program.  As  a  result,  the  profile  analysis  will  indicate 
what  sections  of  source  code  can  be  modified,  reprogrammed  in  assembly 
language  or  reprogrammed  in  microcode  to  improve  execution  response 
time. 

Summary 

This  section  introduced  the  concept  of  using  microprogramming  as 
a  useful  tool  for  improving  the  execution  speed  of  a  program.  How 
microprogramming  achieves  a  speed  gain  was  covered  and  some  advantages 
and  disadvatages  of  microprogramming  were  outlined.  The  accivity  pro¬ 
file  concept  is  a  useful  tool  for  analyzing  a  program.  It  can  be  used 
to  advantage  especially  for  finding  out  the  inefficient  software  por¬ 
tions  of  program.  Once  the  inefficient  software,  is  identified,  it 
can  be  microprogrammed  resulting  in  an  improvement  in  overall  execution 
time  for  the  program. 
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COMPUTER  PROGRAM  SURVEY 


III.  ELECTRON' IC  WAR FACE 

Introduction 

Before  any  microprogramming  effort  could  be  undertaken  for  this 
study  it  was  first  necessary  to  survey  the  spectrum  of  EW  programs 
available,  on  the  EWAE  facility.  The  approach  taken  was  to  compile  a 
list  of  the  available  candidate  EW  simulation  models,  post-data  pro¬ 
cessing  and  digital  signal  processing  programs  being  planned  or  cur¬ 
rently  operating  on  the  EWAF  facilities  computer.  Then  from  this  list 
chose  one  or  more  programs  for  a  more  detailed  analysis  to  determine 
those  areas  that  could  be  microprogrammed  for  improved  performance. 

As  discussed  earlier  the  EWAF  is  a  multi-user  computer  facility 
used  primarily  to  support  a  wide  range  of  SPO  related  EW  evaluation 
and  analysis  tasks.  In  general  the  following  areas  are  supported: 

-  EW  radar/missile  simulation  models 

-  EW  test  data  preparation  and  post-data  processing 
Statistical  and  regression  analysis  programs 

-  Real  time  signal  processing  and  data  acquisition  tasks 

At  the  present  time  the  real  time  signal  processing  and  data  acqui¬ 
sition  capability  is  still  in  a  planning  stage  and  is  virtually  non  exis- 
tant.  Suitable  programs  in  the  following  areas  are  desirable  candidates 
for  building  up  the  EWAF  real  time  signal  processing  capability: 

A/D  conversion  Logical  d.  vision  making 

Filter  programs  Kalman  filters 

Integrators  Max.  entropy  filters 

Correlators  Pattern  recognition 

Threshold  detection  Fnurf’r  transforms 

.11;,..  .i  1  .'..llli.  illilU;,  Vtllsil  L  I.llUilTtlS 
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The  following  criteria  was  identified  as  a  means  of  selecting 
several  programs  from  the  survey  list  for  a  more  detailed  analysis: 

-  Due  to  the  limited  time  available  for  this  study  the  pro¬ 
grams  selected  for  futher  analysis  were  limited  to  FORTRAN 
programs  currently  running  on  the  EWAF  2LMX  computer.  As  a 
result,  many  of  the  larger  EW  simulation  programs,  such  as 
MPASS  or  MECCA,  that  currently  run  on  the  base  CDC  computer 
system  were  excluded. 

Improving  the  execution  time  of  a  program  was  the  primary 
emphasis  for  using  microprogramming  in  this  study.  Ic  fol¬ 
lows  that  programs  which  take  excessive  time  to  run  on  the 
EWAF  should  be  considered  prime  candidates. 

Programs  that  can  share  common  code  algorithms,  data  struc- 

S' 

tures,  sorting  routines  etc.  were  considered  next.  Such 
programs  would  permit  writing  modular  microprograms  that 
can  be  used  by  other  routines,  requiring  only  a  minimum  of 
interfacing. 

Programs  which  are  used  often,  especially  in  a  multiprogram¬ 
ming  environment  were  included.  Frequently  used  programs 
should  be  microprogrammed.  This  would  possibly  permit  speed¬ 
ing  up  the  overall  computer  system  response  in  a  multiprogram 
environment  as  well  as  the  individual  program. 

-  The  HP  RTF  operating  svston  software  and  utility  programs 

available  on  the  EWAF  were  excluded  from  the.  survey;  Source 
code  for  these  programs  was  unavailable.  Further  special 
care  in  writing  microprograms  for  the  system  software  must 
be  exercised  so  as  not  to  void  the  vendors  maintenance  and 
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Any  program  selected  should  be  reasonably  simple  to  modify 
and  interface  to  the  microprograms.  This  criteria  suggests 
looking  at  programs  which  show  high  clustered  activity  areas, 
so  that  microprograms  can  be  written  in  modular  forms.  Pro¬ 
grams  having  many  subroutine  blocks  could  be  potential  candi¬ 
dates. 

-  Any  programs  selected  should  have  a  simple  set  of  input 

data  available  to  check  programs  before  and  after  micropro¬ 
gram  improvement  and  to  ease  the  burden  of  learning  to  run  the 
program  for  this  study. 

Programs  Surveyed 

The  list  of  candidate  programs  considered  for  this  study  is  given 
in  Appendix  A.  The  actual  FORTRAN  coding  for  the  programs  are  not 
presented  in  this  report.  Program  details  related  to  these  programs 
are  minimized  for  two  reasons: 

1.  Comprehension  of  the  activity  profile  results  does  not 
require  an  understanding  of  the  program  code. 

2.  It  is  beyond  the  scope  of  this  effort  to  become  an  ex¬ 
pert  on  the  FORTRAN  content  of  these  programs. 

From  this  list  of  programs  the  next  problem  was  to  select  one  or  two 
for  a  detailed  activity  profile  analysis. 

Conor.il  Criteria  for  .selection 

The  primary  emphasis  for  improving  program  performance  in  this 
initial  investigation  is  program  execution  speed.  Other  performance 
improvements  using  microcode,  such  as  accuracy  requirements,  wore  dis¬ 
cussed  in  Chapter  II  but  are  not  considered  here. 
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Activity  Profile  Experiments 

Using  the  criteria  above,  activity  profile  experiments  were  run 
for  a  variety  of  programs  selected  from  the  survey  list.  The  follow¬ 
ing  programs  were  initially  selected: 

-  BESSEL  function  program 

-  WEDGE  program 

CROSS  CORRELATION  PROGRAM  —  &  CROSO 

AUTO  CORRELATION  £,  SPECTRIAL  ANALYSIS  —  &  AUTSP 

STEPWISE  REGRESSION  --  &  STEPR 

DOWNLINK  JAMMING  PROCRAMS 

FFT  PROGRAMS 

The  BESSEL,  STEPWISE  REGRESSION,  CROSS  CORRELATION,  and  AUTO  COR¬ 
RELATION  programs  were  selected  on  the  bases  of  frequency  of  use  and 
potential  usefulness  for  improving  the  EW  signal  processing  capability. 

The  DOWNLINK  JAMMING  and  WEDGE  programs  were  chosen  because  of 
execution  time,  frequency  of  use,  and  large  number  of  subroutines  used 
in  the  programs. 

The  FFT  programs  were  selected  for  their  commonality  properties  and 
their  potential  as  basis  of  other  digital  signal  processing  programs 
such  as  power  spectrum,  computing  ambiguity  functions,  and  periodograms 
and  real  time  applications. 

An  activity  profile  generator  program  (ACTV),  discussed  below,  was 
used  to  obtain  the  profiles.  For  each  program  examined  the  following 

tasks  were  performed: 

-  Obtain  the  necessary  documentation  and  input  data,  if 
available,  for  running  the  program. 

-  Compile  each  program  to  obtain  a  mixed  FORT RAN /ASSEMBLY 

1  /;.i go  :  i :  .L  i  i\; ,. 
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-  Load  each  program  to  get  a  listing  of  the  loader  ad¬ 
dress  map. 

Run  the  ACTV  program  simultaneously  with  the  program 
being  monitored. 

-  Obtain  a  printout  of  the  activity  profile  results  and 
compare  with  the  load  map  and  FORTRAN  listings  of  the 
monitored  program  to  determine  the  software  areas  which 
could  possibly  be  microprogrammed  to  decrease  execution 

time. 

Although  the  tasks  described  above  are  straight  forward  the  time  re¬ 
quired  to  set  up  the  programs,  schedule  computer  time,  obtain  the  nec¬ 
essary  documentation  and  input  data  made  it  impractical  to  examine 
more  of  the  surveyed  programs. 

A  major  set  back  occured  after  obtaining  some  initial  profile 
results  for  these  programs.  The  EUAF  21MX  computer  developed  an  unex¬ 
plained  problem  which  caused  several  disk  files  to  be  overwritten  and 
also  destroyed  the  RTE-IVB  operating  system.  The  activity  profile 
generator  program  (ACTV)  was  one  of  the  programs  lost.  In  addition, 
the  ACTV  program  was  initially  thought  to  have  been  the  cause  of  the 
system  crash  since  several  files  were  overwritten  with  portions  of 
the  ACTV  program.  The  ACTV  program,  however,  was  cleared  and  it  was 
determined  that  the  problem  was  actually  in  the  EWAF  hardware.  Sev¬ 
eral  weeks  passed  before  the  ACTV  program  was  restored  on  the  EUAF 
system. 

At  this  point  a  decision  was  made  to  analyze  the  profiles  already 
obtained  and  based  on  those  results  choose  one  program  to  examine  in 
greater  detail  for  a  possible  feasibility  study. 
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Implementing  an  Activity  Profile  Generator 


Implementing  an  activity  profile  generator  program  to  analyze 
the  selecced  programs  presented  some  problems.  First,  it  was  dis¬ 
covered  that  the  HP  activity  profile  software  available  (Ref  16)  was 
designed  to  operate  with  RTE  III  and  would  not  run  under  the  EWAF'-s 
RTE-IVB  operating  system.  Further,  it  did  not  execute  properly 
when  tried  on  AFIT's  RTE-III  system.  The  program  ran  but  the  results 
were  obviously  wrong. 

Hewlett-Packard  sells  an  activity  profile  package  (Ref  17)  for 
the  RTE-IVB  system  but  this  solution  was  both  impractical  and  costly. 

An  alternative  solution  was  found  after  talking  with  Mr.  Jim 
Leonard  (Ref  18)  AFWAL/AARF .  He  wrote  an  activity  profile  generator 
for  his  RTE-IVB  system  and  provided  a  copy  of  his  profile  (ACTV)  pro¬ 
gram  for  use  on  the  EWAF  system.  A  listing  of  the  activity  profile 
(ACTV)  program  is  provided  in  Appendix  B.  The  ACTV  program  is  similar 
to  HP’s  version  but  is  simplier  to  use  and  does  not  require  any  modi¬ 
fications  bo  made  to  the  program  being  monitored. 

The  activity  profile  program  was  successfully  implemented  on  the 
EWAF  computer  but  it  would  not  execute  on  AFIT's  RTE-III  system  be¬ 
cause  it  lacked  a  system  subroutine  provided  for  in  the  RTE-IV  system. 
It  is  believed  it  would  not  be  difficult  to  write  the  required  sub¬ 
routine  to  make  this  program  work  on  AFIT's  computer.  This  was  not 
pursued,  however,  at  this  Lime . 

Results 

The  results  of  the  profile  analysis  were  unexpected  as  well  as 

in  format  ive .  First  all  the  pcotiles  examined  exhibited  a  cemT.on  char— 

ir  1 1*  rf  ;  r  i  ■  ■ .  iA,-1)  nr  >i  Mi1  show:-. I  that  the  i  i  t  v  ol  .ativilv  in  the 

programs  examined  occured  in  the  RTE  system  library  subroutines.  This 
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overhead  activity  amounted  to  about  80%  to  90%  of  the  total  activity 
of  the  programs  examined.  The  major  offending  library  routine  was 
the  system  REIO  routine.  The  REI0  routine  accounted  for  20%  to  40% 
of  a  programs  total  activity.  The  REIO  routine  is  associated  with 
swapping  programs  in  memory  and  other  I/O  operations. 

Second,  the  operations,  such  as  complex  multiplication  and 
addition,  are  also  system  libraries.  The  profiles  generated  for  a 
monitored  program  do  not  permit seperating  out  the  activity  in  the 
complex  routines  and  the  software  code  which  use  them.  However,  in 
all  cases  examined,  the  activity  in  the  complex  routines  amounted  to 
less  than  20%  of  the  total  activity.  The  activity  generated  by  the 
REIO  routine  and  I/O  routines  wras  significantly  greater  than  the 
activity  generated  by  the  complex  arithmetic  operations. 

Conclusions 

The  following  conclusions  were  formed  from  this  analysis.  First, 
it  was  concluded  it  would  be  difficult  to  significantly  decrease  the 
overall  execution  time  of  the  programs  being  studied  without  micro¬ 
coding  the  program  system  library  routines.  Microprogramming  the  sys¬ 
tem  library  routines  however  is  beyond  the  scope  of  this  study  and 
would  probably  require  extensive  knowledge  of  the  operating  system 
to  achieve  effective  results.  This  type  of  microprogramming  effort 
might  best  be  left  to  the  vendor. 

This  exercise,  however,  clearly  points  out  one  major  area  where 
microprogramming  a  single  system  library  subroutine  such  as  REIO  couid 
perhaps  speed  up  the  entire  operating  system  by  as  much  as  20%  to  40%. 
When  considering  a  nul  tipr>y., running  environment,  supporting  many  users 
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Third,  from  the  activity  profiles  generated  it  is  clear  that  the 
responses  of  non-system  subroutines  in  a  program  might  be  improved 
tremendously  through  the  use  of  microprogramming  but  the  resulting 
decrease  in  execution  time  may  not  have  a  great  effect  on  the  over¬ 
all  program  performance.  In  contrast  if  a  section  of  code  such  as  a 
subroutine  in  a  major  software  loop  which  is  executed  often  the  pro¬ 
file  results  would  show  a  much  greater  activity  in  this  section  of 
code,  and  may  indeed  be  the  principle  contributing  factor  to  the  over¬ 
all  program  response  time.  Microcoding  this  type  of  activity  would  . 
be  very  productive  in  terms  of  time  spent  in  coding  the  routine  as 
well  as  increasing  the  program  speed. 

Finally,  a  decision  was  made  to  examine  the  fast  Fourier  trans¬ 
form  (FFT)  programs  in  more  detail  based  on  the  above  finding  and  after 
a  discussion  with  Mr.  Phil  Douville  (Ref  19).  Mr.  Douville  pointed  out 
a  program  he  was  developing,  which  calls  a  4098  point  complex  FFT  rou¬ 
tine  79  times  and  takes  2  hours  to  run  on  the  21MX  computer.  It  can 
be  seen  that  even  a  small  speed  up  in  Che  FFT  routine  could  signifi¬ 
cantly  reduce  the  time  his  program  takes  to  run. 

The  FFT  is  a  well  known  and  frequently  used  program  which  has 
many  applications  in  the  EW  analysis  area  as  well  as  in  many  real  time 
digital  signal  processing  applications,  A  microprogramming  FFT  firm¬ 
ware  option  is  available  from  Hewlett-Packard  for  their  21MX  computer. 
However,  attempting  to  analyze  the  FFT  program  activity  and  micro¬ 
program  sections  of  the  code  has  merit.  Further  it  can  adequately 
demonstrate  the  process  and  provide  valuable  user  insight  to  micro¬ 
programming  tuning  concepts. 
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Summary 


This  chapter  discussed  the  list  of  EW  programs  available  and 
some  general  criteria  form  which  several  programs  were  chosen  for 
further  analysis  using  an  activity  profile  generator  program.  The 
activity  profile  generator  program,  ACTV,  was  briefly  discussed  and 
the  initial  results  of  running  profiles  against  several  of  the  chos- 
l a  programs  resulted  in  an  unexpected  conclusion.  The  results  in¬ 
dicated  that  the  PTE  system  prgoram  REIO  was  responsible  for  con- 
summing  a  major  portion  of  a  programs  time.  It  was  concluded  that 
the  REIO  utility  routine  would  be  an  excellent  program  for  micro¬ 
programming  but  because  it  was  a  system  program  was  beyond  the 
scope  of  this  effort.  Finally  it  was  concluded  that  the  FFT  pro¬ 
grams  available  offered  the  best  candidate  for  demonstrating  the 
microprogramming  concepts. 
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IV.  MICROPROGRAMMING  IMPLEMENTATION  PHASE 


Introduction 

A  major  challenge  of  this  study  was  to  assemble  the  various 
HP-21MX  system  components  available  at  AFIT  into  a  working  system- 
and  to  generate  an  RTE-III  operating  system  that  supports  user  micro¬ 
programming.  To  accomplish  this  integration,  supplementary  documen¬ 
tation  on  the  HP-21MX  computer  equipment  and  operating  system  had  to 
first  be  obtained.  This  was  an  on-going  effort  throughout  the  entire 
study.  Part  of  this  effort  included  a  base  survey  of  HP  users  to  un¬ 
derstand  how  others  have  set  up  and  used  their  systems.  Before  any 
actual  microprogramming  effort  could  start  it  was  also  necessary  to 
incorporate  the  HP  microprogramming  software  support  programs  into 
the  RTE-III  system.  Likewise,  considerable  effort  was  spent  on 
learning  the  HP  microprogramming  language  for  the  21MX  M-series  com¬ 
puter  and  also  understanding  the  differences  in  the  language  for  the 
various  other  HP  computer  series.  This  chapter  briefly  highlights 
the  21  MX  system  that  was  assembled  and  the  system  generation  per¬ 
formed. 

Literature  and  HP  Documentation  Survey 

Before  undertaking  any  microprogramming  effort  a  thorough  un¬ 
derstanding  of  the  system  architecture  and  the  RTE-III  operating  sys¬ 
tem  for  the  HP-2IMX  computer  had  to  be  achieved.  This  understanding 
was  pursued  during  the  entire  investigation.  It  involved  the  collec¬ 
tion  and  review  of  available  HP  documentation,  a  literature  review 
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of  professional  publications  and  survey  of  facilities  at  WPAFB  using 
HP  computers. 

A  good  portion  of  this  effort  involved  the  identification,  cross 
reference,  and  functional  understanding  of  the  available  HP  system 
components  for  compatiblity  with  an  RTE-III  operating  system.  A  list 
of  the  available  equipment  surveyed  is  included  in  Table  I.  The 
main  result  of  this  effort  permitted  assembly  of  the  overall  AFIT 
computer  system  for  operation  with  an  RTE-III  operating  system.  Fi¬ 
gure  1  shows  a  general  block  diagram  of  the  AFIT  system  developed 
and  the  present  EWAF  system  available  at  ASD/EXAD. 

Base  Survey 

A  base  survey  of  HP  computer  users  was  undertaken.  This  sur¬ 
vey  primarily  consisted  of  personal  interviews  with  the  various  base 
users  of  HP  computer  equipment.  The  following  objectives  were  ac¬ 
complished  by  this  effort: 

-  The  applications  Co  which  HP  computers  were  being 
put  to  use  was  determined. 

-  The  extent  to  which  HP  microprogramming  was  being 
used  was  determined. 

A  useful  source  of  contacts  experienced  with  using 
HP  equipment  was  established. 

A  list  of  the  various  facilities  visited  is  included  in  Appendix  C. 

The  primary  findings  from  this  effort  are  as  follows: 

1.  The  microprogramming  features  of  the  21MX  were  not 
used  dispite  the  fact  that  several  facilities  had 
the  necessary  writable  control  store  (MCS)  option. 

I 
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TABLE  I 


HP  Hardware  Available  for  Project 


HP  Product 
Option  No. 

Description 

2108A 

Computer  ("A"  power  supply) 

2102A 

Memory  Controllers 

12C2B 

STD  Performance  Memory  Controller 

12747A 

128kb.  STD  Performance  Memory  Module 

12892B 

Memory  Project  Module 

12897B 

Dual  Channel  Port  Controller  (DCPC) 

12539C 

Time  Base  Generator 

12731A 

Memory  Expansion  Module 

12945A 

User  Control  Store  (USC) 

12978A 

Writable  Control  Store  (WCS)  256  Words 

12992B 

Disk  Loader  ROM  for  7905/7906/7920  Disk  Unit 

12992C 

CRT  Terminal  Loader  ROM  for  2648A  graphics  terminal 

12566B 

Microcircuit  duplex  register  modules 

12555B 

Dual  8-bit  D/A  converter  nodule 

264SA 

Graphics  Terminal  with  minicartridge  I/O 

13037-80023 

Disk  Interface  Card 

5060-6282 

Papertape  Photoreader  Interface  Card 

12896-60001 

DMA  Board 

12531 

Buffered  TTY  Register 

02100-60060 

Terminator  Boards 

12998A 

16k  STD  Performance  Memory  Modules 

12994a 

8k  STD  IV r  1. o nil. me o  Mcmorv  Modules 
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2.  None  of  the  facilities  were  currently  involved  in 
using  the  HP  systems  for  real  time  digital  signal 
processing  tasks. 

3.  Real  time  data  acquisition  and  control  applications 
were  the  primary  tasks  for  which  the  HP  systems  were 
being  used. 

HP-21MX  Computer  (M-series) 

Before  studying  microprogramming  it  is  important  to  have  a  thor¬ 
ough  understanding  of  the  computers  architecture  and  its  various 
sections.  This  was  achieved  by  studying  the  vendors  documentation 
on  the  computer.  A  brief  description  of  the  21MX  computer  charact¬ 
eristics  and  special  features  of  microprogramming  are  presented  in 
Appendix  D.  Further  detailed  information  is  available  in  HP  service 
and  operating  manuals  (Ref  20,  Ref  21). 

RTE-III  Microprogramming  System  Generation 

To  configure  a  new  operating  system  to  support  the  HP  micropro¬ 
gramming  features  required  running  the  RTE-III  on-line  generator  pro¬ 
gram  RT3GM.  A  typical  system  generation  normally  takes  several  hours 
to  run  depending  on  the  relocatable  programs  that  are  being  added  to 
make  up  the  operating  system.  Unfortunately  the  RTE-III  system  gen¬ 
eration  proved  to  be  more  of  a  challenge  than  what  it  should  have 
been.  Ordinarily,  tin.  task  of  generating  a  new  operating  system  is 
a  relatively  straight  forward  and  simple  procedure  consisting  of 
the  following  steps: 

-  plan  your  system 

-  generate  an  ANSWER  file 

.  i„  r  .t.-r  pr  -0i.  ....  -  ..  L  .... 
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-  run  Che  RTE  system  transfer  program  -  SWITCH 
perform  a  system  backup. 

Initial  attempts  to  run  the  RT3GM  program  failed.  It  took  several 
weeks  of  effort  to  finally  get  the  program  to  successfully  generate 
the  new  operating  system.  Each  time  the  generator  was  run  the  pro¬ 
gram  would  execute  part  way  through  and  then  halt  at  random.  The 
reason  for  the  computer  halting  is  unknown.  An  effort  to  find  out 
why  the  halts  occured,  failed.  Memory  and  system  diagnostic  tests 
were  run  but  the  tests  were  negative.  Nothing  was  found  to  indicate 
anything  was  wrong..  The  author  found  however  that  cleaning  the  contacts 
of  all  the  memory  boards  seemed  to  help  reduce  the  incidence  of  the 
halts.  Also  rearranging  the  order  of  the  relocatable  programs  in  the 
ANSWER  file  seemed  to  help.  It  was  later  learned  that  other  HP  users  on 
base  have  also  had  similar  halting  problems  and  have  not  found  the 
basic  cause.  The  typical  solution  has  been  to  call  in  HP  service 
and  replace  suspected  memory  boards  with  new  ones.  The  major  hard¬ 
ware  and  relocatable  software  modules  which  were  configured  into 
the  system  are  listed  in  Tabl'  II.  The  various  system  assignments 
and  driver  layout  used  for  this  study  in  setting  up  the  RTE-III  micro¬ 
programming  system  configurations  are  listed  in  Table  III.  A  listing 
of  the  ANSWER  file  used  in  this  study  for  the  microprogramming  systems 
configuration  is  included  in  Appendix  E.  The  actual  details  of  gener¬ 
ating  a  new  RTE-III  operating  system  are  contained  in  Hewlett-Packard's 
RTE-II  /RTE-III  on-line  generator  reference  manual  (Ref  22). 

Once  the  RT3CM  program  finally  generated  a  new  system  configura¬ 
tion  output  file,  the  utility  program  SWTCH  was  used  to  transfer 
the  new  operating  system  created  by  the  on-line  generator  program  to 
a  dirt.  ......  .i.i.nn'l.  ..cc  ...lUdll  ,  oi  Ret  dd  lot  Octal  led  tilt  oLi.eiL  loll 

about  the  use  of  SWTC1I. 
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TABLE  II 


AFIT  RTE-III  Mcroprogramming  System  Configuration 


HARDWARE  MODULES 
HP  2108 -Computer 
112k  Main  Memory 
Memory  Protect 
Time  Base  Generator 
DCPC  (Dual  Cahnnel  Port  Controller) 
Dynamic  Maping  System 
HP  7906  Disk  Subsystem 
HP  2548  Graphics  System  Console 
TI  TTY  Printer 
Photo  Reader 
256  Word  WCS 


SOFTWARE  MODULES 

RTE-III  Memory  Resident  System 

RTE-III  System  Library 

RTE-III  Compiler  Library 

Power fail  Driver,  DVR  43 

EDITR  (Interactive  Editor) 

RTE-III  LOADR  (Relocating  Loader) 

MTM  (Multi  Terminal  Generator) 

RT3GN  (RTE-III  On-Line  Generator) 

HP  7906  Disk  Driver,  DVR  32 

RTE-III  WHZAT  Inquary  Program 

HP  Assembler  ..ad  XREF 

RTE  FORTRAN  IV  Compiler 

RTE  FORTRAN  IV  Formatter 

DOS/RTE  Relocatable  Library 

Multi  Terminal  Driver,  DVR  05 

Line  Printer  Dirver,  DVR  12 

Batch  Monitor  Program 

Batch  Monitor  Library 

Memory  Resident  Programs 

Disk  Resident  Programs 

Writable  Control  Store  (WCS) 
Driver,  DVR  36 

RTE  Microassembler,  2  MICRO 

RTE  Micro  Coss  \ssembler,  Z  MXREF 
RTE  Micro  i  >.id  l.'iiliLy,  2  WLOAD 
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After  generating  the  new  system  configuration  an  off-line  system 
backup  was  performed  using  the  HP  utility  program  !DISKl'P.  It  is  a 
wise  practice  to  back,  up  the  operating  system  especially  when  micro¬ 
programming  development  is  being  done.  The  reason  is  that  a  micro¬ 
programming  mistake  can  be  very  costly.  Microprograms  have  complete 
control  over  the  computer,  and  it  would  be  relatively  easy  to  make  a 
mistake  in  programming  which  could  easily  destroy  the  operating  system 
or  other  files.  Having  a  back  up  system  on  a  spare  disk  under  these 
circumstances  is  absolutely  essential.  A  list  of  the  commands  used 
to  run  JDISKUP  is  included  in  Appendix  F.  More  detailed  information 
on  !DISKUP  is  found  in  Ref  23. 

Microprogramming  Software  Support 

One  of  the  most  important  features  of  the  microprogramming  cap¬ 
ability  offered  by  the  HP  system  is  the  development  support  software 
available.  This  software  package  consists  of  six  programs: 


1.  A  MICROASSEMBLER  &MICRO 

2.  A  CROSS  REFERENCE  GENERATOR  &MXREF 

3.  MICRO  DEBUG  EDITOR  &MDEP 

A.  LOADER  PROGRAM  LWLOAD 

5.  PROM  PROGRAM  TAPE  GENERATOR  SPTGF.N 

6.  DRIVER  PROGRAM  6.DVR36 


The  WLOAD  and  DVR36  programs  are  required  to  be  generated  into 
the  system.  The  other  programs  may  be  loaded  at  any  time.  For  the 
system  implemented  at  AFIT  tlu:  MICRO,  MXREF,  VLOAD  and  DVR36  were 
generated  into  the  system.  The  MDEP  program  was  excluded  from  the 
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initial  system  generation  since  the  on-line  generator  program  halted 
several  times  at  this  program.  The  MDEP  program  was  loaded  into  the 
system  after  system  generation. 

Detailed  information  concerning  the  microprogramming  support 
software  is  contained  in  the  following  references  (Ref  21,  Ref  24, 
and  Ref  25) . 

Problems  Encountered 

The  major  problem  encountered  during  the  implementation  and  sys¬ 
tem  generation  phase  was  the  intermittent  and  random  halting  of  the 
computer  whenever  certain  system  programs  were  run.  The  halting 
would  occur  while  trying  to  run  the  FORTRAN  compiler,  assembler  and 
especially  the  On-Line  generator  program  (RT3GN) .  The  reason  for  why 
the  halting  occured  was  not  discovered.  Diagnostic  tests  were  run 
on  the  computer  memory.  No  problems  were  found.  The  memory  appeared 
to  be  working  properly.  However,  it  was  found  that  after  cleaning 
the  memory  board  contacts  the  halting  was  significantly  reduced.  And 
after  having  generated  a  new  system  configuration  for  microprogramming 
the  halting  was  nearly  eliminated  except  when  the  on-line  generator 
program  was  run. 

It  was  learned  that  other  HP21MX  computer  users  on-base  also  have 
had  similar  halting  proble-s.  Their  solution  has  been  to  have  HP 
service  the  computer  and  install  new  memory  boards.  The  underlying 
cause  of  the  halting  however  has  not  been  completely  resolved.  The 
author  personnally  experienced  this  problem  on  the  AFWAL  21MX  com¬ 
puter  running  with  the  RTE-1VB  operating  system.  The  AFWAL  computer 
billed  while  crying  to  use  the  !‘Oil '-N  compiler. 
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Appendix  G  discusses  other  problems  encountered  during  the  implemen¬ 
tation  phase. 

Results 

The  results  of  this  phase  were  positive  with  respect  to  imple¬ 
menting  a  working  RTE-III  system  and  a  working  microprogramming  capa¬ 
bility.  The  WCS  was  installed  and  checked  out.  The  support  software 
was  exercised  and  found  to  work  fine.  In  fact  the  MICROASSEMBLER 
and  other  micro  support  software  seemed  to  work  better  than  the 
FORTRAN  compiler  and  the  ASSEMBLER.  No  problems  were  encountered 
with  using  the  MICRO,  MXREF,  MDEP,  WLOAD ,  and  DVR 3 6  software.  Like¬ 
wise  no  halts  occured  when  using  these  software  features.  The  PROM 
tape  generator  software  was  not  tested. 

Summary 

This  section  briefly  discussed  the  21MX  computer  system  and  the 
system  generation  required  to  implement  the  microprogramming  capa¬ 
bility  on  /JIT's  computer.  The  V.'CS  feature  was  discussed  and  the  HP 
support  software  highlighted.  The  major  problem  encountered  during 
this  phase  was  the  computer  randomly  halted  whenever  system  programs 
were  run.  The  overall  result  of  this  phase  was  the  successful  imple¬ 
mentation  of  a  user  microprogramming  capability  on  AFIT's  HF  21MX 


computer. 


V .  A  MICROPROGRAMMED  EXAMPLE:  FAST  FOURIER  TRANS  FORM 


Rauscher's  Law:  "Microprogramming  an  inefficient  algorithm 
does  not  make  it  efficient"  (Ref.  26)! 


Introduction 

In  chapter  II  it  was  determined  what  to  microprogram  in  an  appli¬ 
cation  program,  is  a  function  of  many  things  and  first  requires  a  crit¬ 
ical  analysis  of  the  programs  activity.  In  this  chapter,  for  purposes 
of  demonstration  and  analysis,  two  similar  radix  2  fast  Fourier  trans¬ 
former  (FFT)  FORTRAN  subroutine  (Ref.  27,  Ref.  28)  are  examined  for 
program  activity.  A  limited  prototype  microprogram  for  performing 
the  bit  reversal  algorithm  is  developed  to  explore  the  feasibility  of 
employing  microcode  to  speed  up  the  FFT  program  execution  time.  Re¬ 
sults  indicate  that  at  least  some  aspects  of  the  microprogramming 
process  can  be  affected  successfully,,  although  futher  research  is 
needed  to  develop  more  comprehensive  procedures  and  to  address  other 
technical  issues. 

Fast  Fourier  Transform  Programs 

Two  fast  Fourier  transforms  programs,  named  FOl'Rl  and  FOl’RE,  were 
chosen  as  suitable  programs  to  demonstrate  the  concept  of  applying 
microprogramming  to  speed  up  execution  response  time.  Both  routines 
perform  the  well  known  Coo  ley- Turkey  ;.ist  Fourier  trail  form  (FtT) 
algorithm  for  a  point  sequence  of  complex  numbers,  x(m);  m=o,  1...,  N-l, 
where  N  must  be  a  power  of  the,  N=2m.  The  F0UR1  subroutine  was  a  pro¬ 
gram  available  on  the  FWAF  svstorn  (Ref.  28).  The  I'Ot'I’F  subroutine  was 
a  program  adapted  from  Radar’s  rev-tor  fRr»f.  27^.  The  hVKF  subroutine 
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was  chosen  because  it  has  a  similar  number  of  FORTRAN  statements  as 
the  F0UR1  program  but  is  coded  differently.  Radar  recommends  the 
FOURE  program  should  be  used  only  for  demonstration  purposes  since 
faster  routines  are  available.  It  was  expected,  then,  the  FOURE  pro¬ 
gram  would  be  less  efficient  than  F0UR1.  If  sc,  it  would  be  inter¬ 
esting  to  examine  this  difference  when  performing  an  activity  pro¬ 
file  analysis. 

Method 

To  checkout  and  test  the  FFT  programs  it  was  convient  to  adapt 
Radar's  main  program  FOURSUBT  and  modify  it  to  include  the  F0UR1  sub¬ 
routine. 

Radar  uses  a  well  known  sequence  and  deterministic  closed  form 
solution  to  test  the  FFT  programs.  The  sequence  he  uses  is: 

X(n)  »  Qn  n  -  0,  1,...,  N-l  (1) 

Where  Q  is  a  complex  constant  0.9  +  j0.3 
and  the  closed  form  solution  is: 

X(k)  =  (iEqwk)  k  =  °.  1.....  N-l  (2) 

with  W  =  e  -  ^  (3) 

This  test  permits  a  simple  check  of  the  FFT  outputs  with  the  data  re- 

5 

suits  published  in  Radar's  paper  for  N=2  and  also  allows  a  comparison 
of  accuracy  obtained  with  the  21MX  computer. 

A  simple,  timing  subroutine  was  added  to  the  main  program  to  get 
the  21MX  system  time  before  and  after  each  FFT  subroutine  was  called 
In  the  main  test  program.  This  permited  a  simple  way  to  determine 
how  long  it  took  each  FFT  subroutine  to  execute.  The  test  program 
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was  also  made  interactive  with  the  H?  graphics  terminal  so  any  num¬ 
ber  of  points  N=4,  8,...,  512  could  be  tested  by  just  entering  the 
value. 

The  programs,  FOURl  and  FOURE,  were  first  checked  out  on  the 
EWAF  RTE-IVB  system  and  then  later  on  the  AFIT  RTE-III  system  after 
it  was  made  operational.  Two  observations  were  made  by  this  base¬ 
line  check  of  the  FFT  programs.  First  it  was  found  the  FOURl  program 
is  about  twice  as  fast  as  the  FOURE  program.  This  faster  response 
was  expected  as  previously  mentioned.  The  second  observation  was, 
however,  unexpected.  The  accuracy  was  better  on  the  EWAF  21MX  (2112) 
computer  using  RTE-IVB  system  than  on  AFIT’s  21MX  (2108)  computer 
using  RTE-III  system.  Table  IV  shows  the  maximum  absolute  differ¬ 
ences  obtained  between  the  theoretical  closed  form  solution  (Eq  2) 
and  FOURE  FFT  values  and  the  maximum  differences  obtained  between 
original  data  sequence  values  (Eq  1)  and  the  Inverse  r'OURE  FFT  data 
for  the  case  of  N  =  2^.  Also  shown  for  relative  comparison  is  the 
results  obtained  by  Radar  on  a  Honeywell  6030X  computer.  The  dif¬ 
ference  in  accuracy  between  the  EWAF  2112  and  AFIT  2108  computers  is 
attributed  to  several  possible  sources: 

Differences  in  the  system  operating  software  since 
the  EWAF  uses  newer  versions  —  Possibly  the  SINE 
routines  are  more  accurate. 

The  instruction  set  in  the  EWAF  i  ;  probably  a  — .vr 

version  than  in  the  AFIT  system  and  may  use  more 
algorithms. 

-  Other  difference'-  in  Operating  system. 

-  Different  FORTRAN  compiler. 
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The  actual  cause  of  these  differences  was  not  pursued  further.  It 
is  not  the  intent  of  this  study  nor  within  the  scope  to  describe 
the  theory  or  possible  applications  behind  the  FFT  algorithm.  This 
theory  is  adequately  covered  in  many  references.  See  for  example 
(Ref.  29,  Ref.  30,  Ref.  31).  Rather  the  goal  is  to  demonstrate 
those  sections  in  the  FFT  program  software  that  could  be  micropro¬ 
grammed  to  increase  response  time. 

FFT  Profile  Analysis 

After  verifying  that  the  FFT  routines  worked,  the  next  step  was 
to  run  the  Activity  Profile  Generator  (ACTV)  program  to  obtain  an  ac¬ 
tivity  profile  of  each  FFT  subroutine.  The  general  purpose  of  an  ac¬ 
tivity  profile  was  covered  in  Chapter  II.  A  brief  discussion  on  how 
to  run  ACTV  under  RTE-IVB  and  a  copy  of  the  FORTRAN  source  listing  for 
the  ACTV  program  is  included  in  Appendix  H. 

Samples  of  the  profiles  obtained' for  the  FFT  routines  both  with 
and  without  the  mlcrocoded  bit  reversal  routine  are  included  in  Appen¬ 
dix  I. 

An  analysis  of  the  FFT  profiles  was  made  by  examining  the  acti¬ 
vity  addresses  with  the  program  load  map  and  copy  of  the  FORTRAN  mixed 
code  listing.  This  led  to  the  general  findings  shown  in  Figure  2. 

It  was  determined  that  the  Bit  Reversal  routine  differed  signifi¬ 
cantly  between  F01JR1  and  FO'JRE  in  terms  of  activity.  The  FOl'Rl  program 
averaged  about  3.6%  total  activity  while  the  FOURE  program  averaged 
about  17%  of  the  total  activity.  The  activity  for  the  butterfly  com¬ 
putations  in  each  program  were  roughly  the  same,  however,  except  for  one 
observation.  On  close  examination  of  the  mixed  FORTRAN  code  listing  for 
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the  FOURE  program,  one  arrives  at  the  conclusion  that  approximately 
18%  of  the  total  activity  was  spent  in  the  execution  of  the  two  CONTIN¬ 
UE  statements  of  label  110  and  120  in  the  program.  The  continue  state¬ 
ments  form  the  butterfly  loops  in  the  FOURE  program.  From  the  activity 
profile  these  two  statements  had  93  and  13  interrupts  respectively • out 
of  a  total  of  584  hits  for  the  entire  subroutine.  Hence  approximately 
18%  of  the  FFT  is  spent  here.  The  F0UR1  software  structure  is  differ¬ 
ent  from  the  FOURE  program  and  does  not  exhibit  this  effect.  The  ob¬ 
servation  to  be  made  here  is  that  microcoding  this  short  section  of  code 
in  the  FOURE  program  could  speed  up  the  FFT  as  much  as  microcoding  the 
entire  bit  reversal  algorithm  which  takes  approximately  17%  of  the  F FT 
time. 

From  Figure  2  it  is  seen  about  20%  of  the  execution  time  for  the 
FOUR1  program  is  spent  computing  the  SINE  vlaues.  Likewise  about  13.5% 
of  the  activity  in  the  FOURE  program  i.s  spent  computing  the  SINE  values. 
The  remaining  activity  shown  In  Figure  2  is  spent  in  other  parts  of  the 
FFT  software  code.  This  overhead  software  was  not  looked  at  in  further 
detail  for  microcoding. 

Predicted  Improvement  via  Microprogram  Enhancement 

The  results  of  the  profile  analysis  shows  rough! y  the  amount  of 

speed  up  one  might  expect  by  microprogramming  the  various  sections  of 
the  "FT  programs.  Thus  for  example  one  might  infer  that  microprogram¬ 
ming  the  software  that  performs  the  bit  reversal  should  speed  up  the 
F0UR1  program  roughly  by  3 '-4%  and  the  FOURE  program  about  18%. 

Mi_c roprog ram  Re cj_i i_ir omen t s 

Now  consider  the  root!  ( remonts  for  m i cronrogrammi ng  the  various  sec- 
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computations  as  such.  Unfortunately  the  results  of  the  FFT  activity 
profile  analysis  indicated  microprogramming  the  Bit  Reversal  software 
will  improve  the  overall  speed  only  3  to  18%,  nevertheless,  it  is  an 
example  of  a  prime  candidate  for  microprogramming. 

On  the  other  hand  consider  the  Butterfly  and  SINE  software.  This 
code  is  mostly  composed  of  complex  multiplications,  additions,  subtrac¬ 
tions  and  the  SINE  calculations.  The  floating  point  multiply  and  add 
and  subtract  routines  are  already  microprogrammed  by  HP  and  are  inclu¬ 
ded  in  the  base  instructions  set.  Nothing  is  gained  by  microprogram¬ 
ming  these  routines  by  the  user.  This  further  suggests  that  it  is  pos¬ 
sibly  a  waste  of  time  to  try  to  microcode  this  section  since  it  is  al¬ 
ready  running  nearly  as  fast  as  microcoding  will  allow.  The  only  excep¬ 
tion  is  in  the  FOURE  program.  Close  examination  of  the  mixed  FORTRAN/ 
ASSEMBLER  listing  showed  that  the  CONTINUE  statements  that  end  the  but¬ 
terfly  code  use  about  18%  of  the  total  time. 

Microprogramming  the  SINE  computations  would  speed  up  the  process 
as  noted  before  but  it  would  also  require  a  very  long  microprogram. 
Hence,  it  is  possible  that  it  would  exceed  che  256  word  WCS  area  avail¬ 
able  for  this  study.  This  estimation  was  based  on  observing  the  length 
of  the  microprogrammed  SINE  computations  in  the  base  instruction  set  for 
the  E  and  F  series  HP-21.MX  computers. 

Another  consideration  is  that  the  21MX  architecture  for  the  M-ser- 
ies  lias  only  one  SWF.  register  in  the  control  section.  The  SAVE  regis¬ 
ter  is  used  for  microsubroutine  jumps.  In  the  M-series  only  one  jump 
is  allowed  in  a  microprogram:  jumps  cannot  be  nested.  This  limits 
how  a  user  can  use  the  base  instruction  set.  The  user  cannot,  for  exam¬ 
ple,  jump  directly  to  the  microprogrammed  floating  point  routines  in  the 
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base  set.  Because  the  floating  point  routines  employ  jumps  to  other 
microprograms  in  the  base  set,  user  microprograms  cannot  access  them 
directly.  This  situation  is  corrected  in  the  HP-21MX  E  and  F  series 
computers  by  providing  three  SAVE  registers.  If  an  E  and  F  series 
computer  were  available  it  would  be  reasonable  that  an  entire  complex 
FFT  algorithm  could  be  coded  into  a  256  WCS  board  using  jumps  to  the 
microprogrammed  floating  point  and  SINE  routines  in  the  base  instruc¬ 
tion  set. 

Design  of  the  Microcode  *• 

As  a  result  of  the  FFT  profile  analysis,  limited  WCS  area  and  for 
purposes  of  demonstration  it  was  decided  to  attempt  to  microcode  the 
bit  reversal  sorting  algorithm.  Figure  3  shows  a  flow  chart  of  the 
overall  algorithm  that  was  microcoded.  The  computation  of  the  FFT  but¬ 
terfly  was  not  microcoded.  This  section  describes  the  design  process 
for  microprogramming  the  bit  reversal, algorithm. 

The  design  was  divided  into  two  distinct  parts:  the  bit  reversal 
algorithm  and  the  complex  floating  point  data  exchange.  A  listing  of 
the  entire  microprogram  developed  is  given  in  Appendix  J . 

The  algorithm  for  forming  the  reversed  index  number  was  adapted 
from  a  microprogram  listing  for  an  integer  FFT  routine  written  for  the 
HP  2100  computer  (Ref.  32).  The  HP  2100  microcode  unfortunately  is  in¬ 
compatible  with  the  HP-21MX  computers  thus  it  required  considerable 
effort  to  learn  the  2100  microlanguage  to  first  make  sense  out  of  the 
algorithm  and  adapt  it  to  the  21MX  microprogramming  language. 

The  complex  data  exchange  section  was  first  microprogrammed.  The 
exchange  section  takes  the  complex  floating  point  numbers  stored  in  the 
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data  array  and  switches  the  values  with  the  bit  reversed  index.  After 
writing  the  exchange  microcode  sec  cion  it  was  assembled  and  debuged  us¬ 
ing  the  HP  microassembler  and  RTE  micro  debug  editor  (MDEP)  software 
utility.  The  microcode  for  the  bit  reversal  was  written  next  and  de¬ 
buged.  After  each  section  was  working  properly  they  were  combined  into 
a  single  microprogram  and  stored  as  a  disk  file.  The  final  step  after 
writing  and  debuging  the  microprogram  using  MDEP  was  to  write  the  nec¬ 
essary  assembly  language  program  to  pass  the  required  parameters  and 
addresses  from  the  FFT  program  and  to  initiate  the  microprogram's  exe¬ 
cution.  The  assembly  interface  routine  is  listed  in  Appendix  J. 


Bit  Reversal  Algorithm 

The  simplest  way  to  bit  reverse  a  number  is  simply  to  perform  a 
right  shift  on  the  number  and  load  it  bit  by  bit  into  a  second  regis¬ 
ter. 


1st.  register 
2nd.  register 


shift  #  right  bit  by  bit 

into  a  second  register 


Unfortunately,  it  is  not  this  simple  to  do  with  the  2LMX  architecture. 
Ideally  it  should  only  take  N  shifts  to  accomplish  the  bit  reversal  for 
an  N  bit  index  number.  But  since  it  is  not  possible  to  do  this  direct¬ 
ly  with  the  2IMX  micro language  an  algorithm  must  be  used  to  perform 
the  bit  reversal. 

Fortunately,  the  algorithm  is  simple  and  easily  programmed  in  micro¬ 
code  using  the  extra  storage  registers  available  to  the  microprogrammer. 
The  algorithm  is  as  follows: 

Step  1.  Put  the  number  to  be  bit  reversed  in  a  scratch  register. 
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Step  2.  Then  examine  the  least  significant  bit  (LSB)  of  the  num- 


Step  3.  If  the  LSB  is  set  then  shift  a  bit  into  a  second 
scratch  register. 

Step  4.  If  the  LSB  is  zero  then  shift  a  zero  into  the  second 
scratch  register. 

Step  3.  Now  repeat  steps  2  through  4  and  continue  in  this  man¬ 
ner  until  all  N  bits  of  the  number  have  been  examined. 

Step  6.  The  bit  reversed  number  will  then  be  in  the  second 
scratch  register. 


Microprogramming  the  Bit  Reversal 


The  microprogram  code  to  perform  the  above  algorithm  is  illustra¬ 
ted  in  Figure  4.  The  microprogram  takes  8  microinstructions.  The 
microinstructions  are  in  a  loop,  however,  which  repeats  the  microcode 
according  to  the  number  of  bits  in  the  number  being  reversed.  The  micro¬ 
program  shown  assumes  the  number  to  be  reversed  is  in  register  S4.  It 
assumes  S2  is  initially  set  to  zero  and  will  contain  the  bit  reversed 
number  when  done.  It  assumes  S3  Initially  contains  the  number  of  bits 
for  the  number  being  reversed. 


Testing  the  Microprogram 

After  writing  the  microprogram  it  was  microassembled  using  the  MICRO 
software  assembler  ami  then  loaded  into  the  L'CS  using  the  MDCP  program. 
The  MDEP  program  was  used  to  change  parameters,  microinstructions,  regis¬ 
ter  values,  and  run  the  microprogram  from  the  system  console  to  test  and 
debug  the  program. 

The  major  problem  experienced  with  testing  and  debuging  the  program 
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run  under  MDEP  if  a  loop  or  faulty  code  occured  or  other  error  was  made 
the  computer  would  hang  up  in  the  microcode.  Microprograms  run  all  the 
time  in  the  HP  machine,  even  after  the  machine  is  halted.  It  was  impos¬ 
sible  to  get  out  of  a  faulty  loop  in  the  microcode  without  turning  the 
power  off.  When  power  is  turned  off  of  course,  the  WCS  loses  the  micro¬ 
program  and  it  must  be  reloaded.  Fortunately  it  is  simple  to  reload 
the  code,  provided  it  was  first  saved  on  a  disk  file.  However  correc¬ 
tions  which  were  made  to  the  program  while  debugging  are  lost  and  will 
need  to  be  restored. 

Microprograms  can  be  loaded  into  WCS  in  several  ways.  Two  ways 
were  tried  during  this  study.  First  the  MDEP  program  can  load  the  micro¬ 
code  stored  on  a  disk  file  into  the  WCS.  This  method  was  primarily  used 
during  the  debug  verification  phases  of  the  program  development.  Any 
microcode  loaded  using  the  MDEP  is  static.  This  means  the  micropro¬ 
gram  can  not  be  changed  as  a  software  program  is  executing.  It  is  equi¬ 
valent  to  microcode  residing  in  a  PROM.  The  second  method  to  load  micro¬ 
code  into  WCS  is  to  use  the  WLOAD  software  utility  program  feature.  This 
utility  permits  the  WCS  to  be  loaded  from  a  calling  FORTRAN  or  ASSEMBLY 
program.  In  addition  it  permits  the  calling  program  to  dynamically  load 
the  WCS  while  it  is  executing.  The  dynamic  capabilities  of  using  this 
utility  program  was  checked  out  by  modifying  the  test  program  to  inter¬ 
actively  load  the  microprogram  into  WCS  using  the  WLOAD  feature.  Using 
the  WLOAD  feature  permitted  loading  the  WCS  interactively  trora  the  sys¬ 
tem  console  by  simply  specifying  the  file  name  containing  the  micro¬ 
program  whenever  the  test  program  was  run.  Once  the  microprogram 
was  loaded  into  WCS  using  this  technique  it  was  not  changed  during  the 
execution  of  the  test  program. 
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Effectiveness  of  the  MicroproR-amned  Bit  Reversal 


The  effectiveness  of  the  microprogram  bit  reversal  sorting  algorithm 
developed  in  this  study  was  verified  on  both  the  AFIT  21MX  2108  compu¬ 
ter  and  AFWAL' s  21MX  2112  computer  svstem. 

Two  measures  of  effectiveness  were  obtained.  First,  the  average 
run  time  of  the  FFT  programs  with  and  without  the  microprogram  was  mea¬ 
sured  and  the  relative  percent  improvement  in  speed  up  computed.  Sec¬ 
ond,  a  speed  up  factor  for  the  microprogram  itself  was  determined  by 
comparing  the  measured  average  run  time  of  the  FORTRAN  bit  reversal  soft¬ 
ware  with  the  theoretical  computed  run  time  for  the  microprogram.  The 
theoretical  computation  of  the  run  time  for  the  microprogram  is  given 
in  Appendix  K. 

Figure  5  and  Figure  6  show  the  comparative  average  baseline  run 
time  obtained  for  the  F0UR1  and  FOURE  programs  without  using  the  micro¬ 
program.  This  data  was  taken  on  the  AFIT,  EWAF  and  AFWAL  HP  21MX  com¬ 
puters. 

Figure  7  and  Figure  8  show  the  comparative  average  run  times  ob¬ 
tained  for  the  F0UR1  and  FOURE  programs  when  the  microprogram  is  called. 
Data  was  obtained  only  for  the  AFIT  and  AFWAL  computer.  The  EWAF  com¬ 
puter  does  not  have  a  microprogramming  capability  configured  into  the 
system.  The  AFWAL  computer  however  had  the  microprograramrble  writable 
control  store  module  installed  and  the  microprogram  support  software 
configured  into  their  RTE-IVB  opera  tin.-.;  system.  It  was  therefore  pos¬ 
sible  to  run  the  same  microprogram  on  the  AFWAL  computer  to  check  the 
results  obtained  on  the  AFIT  computer  and  also  verify  that  microprograms 
developed  at  AFIT  can  successfully  run  on  another  HP  21MX  computer  sys¬ 
tem  on  base. 
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Comparison  of  Run  Time  for  F0UR1  Program 


AVERAGE  RUN  TIME  FOR  FOURE  PROGRAM 


Figure  6  Comparison  of  Run  Time  for  FOURE  Program 


TABLE  V 


Relative  %  Improvement  Factor  over  Software  FFT  routines 


Number 

of 

AFIT  2108 

Computer 

AFWAL 

2112  Computer 

Points 

F0UR1 

FOURE 

FOUR1 

FOURE 

4 

38 

37 

- 

- 

8 

-6.97 

16 

- 

- 

16 

13.3 

6.4 

- 

- 

32 

4.45 

2.44 

- 

- 

64 

8.99 

1.13 

- 

- 

128 

-8.92 

11.98 

5.86 

2.49 

256 

5.11 

3.74 

5.73 

7.44 

512 

5.15 

4.85 

5.58 

6.48 

Comparison  of  Run  Time  for  F0UR1  Program 
with  Microprogramed  Bit  Reversal  Routine 


AVERAGE  RUN  TIME  FOR  FOURE  PROGRAM 


Figure  8  Comparison  of  Run  Time  for  FOURE  Program 

with  Microprogramed  Bit  Reversal  Routine 


Table  V  shows  the  relative  percent  improvement  in  average  run  time 
obtained  for  the  various  number  of  points  run  for  the  FOUR!  and  FOFRE 
programs  when  the  microprogrammed  bit  reversal  routine  is  used.  The 
table  shows  the  improvement  factor  varies  considerably  for  the  differ¬ 
ent  number  of  points  tested.  This  is  attributed  in  part  to  the  in¬ 
precision  in  the  method  used  to  time  the  routines  particularly  for  the 
cases  where  the  number  of  points  is  less  than  32  points. 

The  method  of  timing  was  accurate  only  to  about  10  milliseconds 
by  using  the  computers  system  clock.  No  conclusions  are  drawn  on  the 
timing  results  obtained  for  the  cases  having  less  than  32  points. 

One  discrepancy  is  apparent  in  table  V  however,  for  the  128  point 
case.  In  the  128  point  case  the  timing  should  be  relatively  stable  but 
the  results  showed  a  speed  up  did  not  occur  on  the  AFIT  computer  system. 
This  is  contrary  to  what  was  expected.  To  check  this  discrepancy  the 
same  program  was  run  on  the  AFWAL  computer  system.  The  timing  results 
obtained  indicated  a  5.86%  improvement  for  the  128  point  case.  This 
Improvement  was  in  line  with  what  was  expected  from  the  profile  analy¬ 
sis  of  the  FFT.  Likewise  the  factors  obtained  for  the  256  and  512 
point  cases  tend  to  stablize  around  5-6  percent  improvement  which  is 
roughly  that  expected  from  the  activity  analysis. 

An  attempt  was  also  made  to  measure  the  average  run  time  of  just 
the  FORTRAN  bit  reversal  code.  Figure  9  shows  the  results  measured 
and  compares  thorn  with  the  theoretical  calculated  run  time  for  the 
microcoded  bit  reversal  routine.  Figure  10  shows  the  speed  up  factor 
of  the  microprogram  over  the  equivalent  software  code.  From  Figure 
10  It  Is  clear  that  the  microprogram  is  roughly  9  to  10  times  faster 
than  the  software  routine. 


SPEED  UP  FACTOR 


Summary 


In  summary,  this  section  analyzes  two  similar  fast  Fourier 
transform  programs  F0UR1  and  FOURS  with  the  activity  profile  gen¬ 
erator  (ACTV)  program.  Analysis  of  the  resulting  activity  profiles 
reveal  microprogramming  the  bit  reversal  sorting  section  would  im-. 
prove  overall  performance  approximately  4%  to  17%.  A  limited  proto¬ 
type  microprogram  was  developed  for  performing  the  bit  reversal  sort¬ 
ing  algorithm.  The  resulting  performance  using  this  microprogramming 
was  verified  experimentally.  The  results  show  that  the  micropro¬ 
gramming  did  speed  up  the  bit  reversal  algorithm  itself  by  almost  a 
factor  of  10  but  improved  the  overall  response  of  the  FFT  programs 
only  by  about  5-6  percent. 
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VI.  RESULTS  AND  CONCLUSIONS 


Introduction 

This  section  summarizes  the  results  and  conclusions  arrived  at 
during  this  study.  This  study  is  divided  into  three  parts.  First, 
the  survey  of  electronic  warfare  programs  on  the  ASD  Electronic  War¬ 
fare  Analysis  Facility  (EWAF)  and  the  selection  criteria  and  analy¬ 
sis  used  to  select  a  program  for  possible  improvement  using  micro¬ 
programming  techniques.  Second,  the  implementation  phase  where  the 
AFIT  21MX  computer  was  assembled  and  a  RTE-III  system  generation  was 
done  to  implement  a  user  microprogramming  capability  on  the  system. 

And  third  an  actual  demonstration  involving  the  development  of  a 
microprogrammed  bit  reversal  sorting  algorithm  for  an  FFT  program  and 
resulting  tests  to  measure  the  relative  speed  gain  achieved  when  the 
FFT  program  is  run. 

• 

Results 

The  EW  program  survey  resulted  in  a  list  of  programs  used  for 
electronic  warfare  analysis  and  data  reduction  tasks.  Most  of  the 
programs  are  currently  running  on  the  EWAF  21MX  facility.  Ssome  of  the 
larger  simulation  models  however  run  on  the  base  CDC  computer  and  are 
being  considered  for  implementation  on  the  EWAF  computer  in  the  future. 
Likewise  a  number  of  digital  signal  processing  programs  (such  as  the 
fast  Fourier  transform  programs)  are  of  general  interest  to  the  EN 
community  and  would  be  useful  to  implement  in  real  time.  As  a  result 
of  this  survey  several  programs  were  examined  for  possible  improvement 
in  their  execution  speed.  An  activity  profile  of  some  of  the  programs 
showed  that  it  would  be  difficult  to  improve  eortain  nreerams  which 
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exhibit  high  activity  in  the  system  library  routines.  As  a  result  of 
this  analysis  it  was  decided  to  perform  a  more  detailed  profile  analy- 
sis  on  the  FFT  programs  available  and  to  demonstrate  the  microprogram¬ 
ming  concept  by  microcoding  portions  of  the  FFT  programs. 

It  was  found  from  the  detailed  profile  analysis  of  the  FFT  pro¬ 
grams  that  the  bit  reversal  sorting  took  approximately  3%  to  17%  of  the 
FFT's  time.  By  microprogramming  the  bit  reversal  portion  of  the  FFT  it 
was  demonstrated  a  speed  up  of  approximately  5-6%  resulted  for  the  en¬ 
tire  FFT  routine.  This  is  roughly  that  predicted  by  the  profile  analy¬ 
sis  of  the  FFT.  Further,  measurements  of  the  run  time  for  bit  reversal 
FORTRAN  software  compared  with  the  theoretical  microprogrammed  bit  re¬ 
versal  run  time  resulted  in  a  speed  up  of  9  to  10  times  over  the  same 
function  coded  in  FORTRMi. 

The  bit  reversal  algorithm  microprogrammed  in  this  case  demon¬ 
strates  several  important  points:  one,  it  shows  that  microcoding 
does  decrease  execution  time.  Two,  it  shows  also  that  what  is  micro¬ 
programmed  does  not  necessarily  improve  the  overall  response  of  a 
program.  Three,  the  overall  improvements  to  a  program  can  b6  reason¬ 
ably  predicted  through  analysis  using  an  activity  profile  generator. 
Four,  it  illustrates  the  complexity  and  attention  to  detail  the  user 
must  deal  with  when  using  microprogramming.  Five,  it  illustrates  the 
limitations  imposed  by  HP-21MX  M-series  computer  when  considering  the 
use  of  the  floating  point  routines  in  user  microprograms.  Six,  it 
shows  the  flexibility  offered  by  HP  microprogramming. 

Conclusions 

1.  The  study  objective  of  implementing  a  dynamic  user  micropro- 
•r  i-vr  .••••'  V  I  a  |  m  '  \"T'r  r  i . . 

fully  capable  of  supporting  user  microprogramming  research  as  was  demon- 
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strated  by  developing  and  executing  a  microprogrammed  bit  reversal 
algorithm  for  an  FFT  program. 

2.  An  activity  profile  scheme  has  been  shown  to  highlight  a 
programs  performance  from  which  a  user  can  easily  deduce  the  signifi¬ 
cance  microprogramming  will  have  on  performance. 

3.  Microprogramming  is  specialized  and  more  difficult  than 
assembly  language  programming  and  is  worth  learning  but  its  benefits 
are  higly  dependent  upon  the  programs  application.  It  can  provide  a 
powerful  tool  for  solving  time  critical  applications  but  at  the  same 
time  it  is  not  a  good  tool  for  everyday  general  applications. 

4.  It  was  demonstrated  that  AFIT  has  the  capability  to  support 
user  microprogramming  for  the  HP-21MX  computer  systems  and  could  pro¬ 
vide  a  significant  focal  point  for  HP  users  at  Wright-Patterson  through 
AFIT  research  on  project  involving  microprogramming. 

Recommendations  , 

Based  on  the  results  of  this  study  the  following  recommendations 
for  further  study  are  submitted: 

1.  Continue  investigating  the  capabilities  user  microprogramming 
offers  especially  for  increasing  throughput  in  a  multi-user  environment. 
One  area  observed  in  this  study  which  could  greatly  effect  throughput  in 
a  multi-user  environment  was  microprogramming  the  HP  RTE  system  library 
and  utility  programs.  Another  area  for  investigation  could  be  the  possi¬ 
bility  of  a  distributed  system.  AFIT  currently  has  two  21MX  computers 
and  is  considering  buying  the  F-series  21MX  computer.  One  computer 
could  be  used  to  monitor  the  multi-user  operating  environment  and  it 
could  pass  off  time  critical  programs  to  a  second  computer  for  faster 
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real  tine  response.  Microprogramming  could  be  used  here  not  only  to 
speed  response  but  also  to  taylor  the  interface  hardware  for  maximum 
utilization. 

2.  Expand  the  basic  understanding  and  procedures  outlined  here¬ 
in  and  investigate  desirable  algorithm  sets  which  will  optimize  the 
execution  of  programmed  applications.  For  example  various  algorithms 
which  perform  the  FFT  using  integer  operations  are  faster  than  float¬ 
ing  point  routines.  Investigate  the  drawbacks  and  possible  advantages 
of  user  dynamic  microprogramming  and  expand  this  to  include  the  effects 
of  multi-users  on  the  system.  A  related  area  of  study  is  the  use  of 
higher  order  language  compilers  to  directly  translate  programs  into 
microcode.  A  final  area  of  study  includes  development  of  better  soft¬ 
ware  support  tools.  HP  software  support  is  excellent  for  microprogram 
development  but  it  could  be  improved.  For  long  microprograms  it  would 
be  useful  to  develop  an  activity  profiler  to  monitor  microprograms. 

This  would  permit  studying  the  activity  within  microprograms  being  de¬ 
veloped. 

3.  Recommend  that  AFIT  upgrade  the  existing  21MX  M-series  com¬ 
puter  to  an  F-series  computer.  In  addition,  consideration  to  obtain¬ 
ing  a  9-track  tape  unit,  A  pen  plotter  and  graphics  printer  should  be 
pursued.  The  F-series  computer  is  a  much  more  powerful  computer  and 
has  the  floating  point  routines  implemented  in  hardware.  In  addition 
the  F-series  can  use  more  WCS  than  the  M-series  and  is  therefore  much 
more  flexible  and  tailorable.  The  microprogramming  capability  has 
been  improved  over  the  M-series  and  allows  for  multiple  microsubrou¬ 
tine  jumps  which  limited  the  M-sertes.  A  magnetic  tape  unit  is  neces¬ 
sary  to  interface  with  various  labs  and  the  CDC  computer  system  on 


scored  on  tapes  from  other 


base.  For  example,  data  acquisition  results 
facilities  can  easily  be  processed.  The  plotter  and  graphics  printer 
are  required  to  put  computed  results  into  useful  form  for  analysis  and 
for  real  time  monitoring  t. sks. 
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APPENDIX  A 


List  of  Programs  Surveyed 

The  following  is  a  list  of  the  post-data  processing,  digital  signal  pro¬ 
cessing,  and  electronic  warfare  analysis  programs  used  by  the  Electronic 
Warfare  Division,  ASD/EMAD,  Wright-Patterson  AFB. 

Description  Source 


Data  Transformation  and  Mathematical 


Programs 


1. 

Transformation  Program  (Like  the  BMD 

Transgeneration) 

&TRANF 

2. 

Matrix  Arithmetic  Program  (Add,  Subtract, 

Multiply  up  to  20  x  20  matrices) 

&MRITH 

3. 

Simultaneous  Equation  Solver  (Up  to  22 

Equations,  22  unknowns) 

&SIMEQ 

General  Statistics  Programs 

4. 

General  Statistics  Information  Program 

(Elementary  calculation,  point  estimates, 
mean  stan''.rd  deviation,  a  histogram) 

& STATS 

5. 

General  Statistics  for  Multiple  Croups 

(Same  as  STATS  but  many  groups  at  same  time) 

&MULTI 

6. 

Paired  T-Test  Program  (Student 's-T  for  paired 
observations) 

&PAIKT 

7. 

Test  of  Hypothesis 

(Test  Ho:  u  =  p0  or  ,!o:  Mi  =  M2  ) 

ATTEST 

8. 

Moving  Averages  in  A  Time  Series 

&M0VAV 

9. 

Cross  Tabulation  Program  (Cross  Tabulation 

on  Two  Single  Dimensional  Fixed  Point  Arrays) 

&CRTAB 

10. 

Discriminant  Analysis  Program 

&DSCRA 

11. 

Sample  Sice  Determination  on  the  Simple  Variance 

&SAMPL 

(Estimates  sample  sice  from  :j  -  and  degrees  of 
f  roodo-vA 
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Descript  ion 

Source 

12. 

Confidence  Interval  for  Mean  and  Variance 
Normal  Distribution 

of  a 

& COMF1 

13. 

Cumulative  Distribution  Program 

&CUMDS 

14. 

Test  of  Hypothesis  for  Variance  (Tost  Ho: 
or  Ho:af  =*  ai  ) 

2  2 
o  =  a0 

&VHT0H 

Specialized  Statistic  Programs 

15.  X-4  Plotter  on  Printer  (Plots  on  a  standard  &XYPLO 

Printing  device  Not  Plotter) 

16.  Histogram  Plotting  Program  (Plots  on  a  standard  &HSTPL 

Printer  not  a  Plotter) 

17.  Bartlett's  Homegeniety  of  Variance  Test  &BARTL 

18.  Duncans  Multiple  Range  Test  &DUNCN 

19.  Wilcoxon-Mann-Whiting  Test  &WILMA 

20.  Kendall's  Coefficient  of  Concordnace  &KENDW 

(Checks  for  Ties) 

21.  Kendall's  Coefficient  of  Concordance  &KENDL 

(No  check  for  ties) 

22.  Kendall's  Tau  Correlation  &KETAU 


Cross  Correlation  Programs 

23.  Cross  Correlation  Program  &CR0SC0 

24.  Multiple  Correlation  Program  (row  Sums  of  Squares  &C0RRL 

Cross  Product  Matrix,  etc) 

25.  Multiple  Correlation  Matrix  (Pearson  Correlation  &C0RMA 

Coefficient) 


26. 

Auto  Correlation  and  Snectr 

’.1  Analysis 

SAi'TSP 

Regression 

Analysis  Program 

27. 

Least  Squares  Regression 

&LSQRG 

28. 

Linear  Regression  With  Rei'l 

l  it  i on 

4 1.  IMRE 

go , 

Bi  ;  i \ 

Or  i  >  I  *  )A 
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Source 


Description 

30.  Linear  Regression  Confidence  Interval  Estimates 

(Least  Squares) 

31.  Polynomial  Regression  (Up  to  13th  degree 

Polynomial) 

32.  Polynomial  Regression  with  Confidence  Intervals 

33.  Orthogonal  Polynomial  Regression 

34.  Non-Linear  Regression  Program  (Least  Squares 

Regression) 

35.  Non-Linear  Regression  of  an  Arbitrary  Function 

36.  Non-Linear  Regression  of  a  Single  Variable 

Function 

37.  Stepwise  Regression 

38.  Multiple  Regression 

39.  Pooling  of  Groups  in  Regression 

Test  of  Fit  Programs 

40.  Chi-Square  Goodness  of  Fit  Test  . 

(For  normal  Distribution) 

41.  Kolraogorov-Smirnov  Test  of  Fit 

The  ANOVA  Programs 

42.  Completely  Randomized  Design 

43.  Completely  Randomized  Design  with  Subsanpling 

44.  Randomized  Complete  Block  Diagram 

45.  Randomized  Complete  Block  Design  with  Subsimpling 

46.  Two  Way  Factorial  Design 

47.  Three  Way  Factorial  Design 

48.  Analysis  of  Variance  Info  Generator 

Subprograms  in  Sr.it  Pack 

♦  '  t  •  ' !  i  L  >  ;  . .  v L  L  .  i  :  •  L  ' »  ’  a  .i  i 1  r  >  *  • » w  *  i  V  —  « ••  .  *  l  L*  L .  l  uUl 

s  Subroutine) 


&LSRCI 

&LSRCI 

&PLYCI 

&0P0LY 

&NLSQR 

&0MEGA 

&S0MEG 

&STEPR 

&MULRE 

&POOL 

&XIFIT 

&RST0F 

&ANOV1 

&AN0V2 

&AN0V3 

&AN0V4 

&AN0V5 

&x\N0V6 

&AN0V7 
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Source 


Description 

50.  Simpson  Integration  Subroutine 

51.  Matrix  Inversion  Subroutine 

Contains: 

SYMIV  -  Symetric  Matrix  Inversion 

PINIV  -  Maximum  Pivoltal  Element  Inversion 

QMINV  -  Short  Cut  Matrix  Inversion 

SPMAT  -  Check  for  Significance  of  Pivotal  Elements 
Before  Inversion 

MATIV  -  Matrix  Inversion  with  Simultaneous 
Equation  Solver 

52.  Histogram  Plotting  Subroutine 

(Like  HTSPL  but  Subroutine) 

53.  Simultaneous  Equation  Solver  Subroutine 

(Like  SIMEQ  only  Subroutine) 

54.  Probability  Subprograms 

Contains : 

ANORM  -  Normal  Distribution  Function 

BINOM  -  Binomial  Distribution  Function 

POIS  -  Poisson  Distribution  Function 

FPROB  -  F  Cummulative  Probability  Function 

CHIFN  -  Chi-Square  Cummulative  Probability  Function 

CHISQ  -  Chi  Square  Subroutine 

FDIST  -  Inverse  F  Distirbution  Function 

TDIST  -  Student's  X  Distribution  Subroutine 

XARF.A  -  Normal  Probability  Function 

55.  Time  Series  Plotting  Function 

(On  Printer  Not  Plotter) 

56.  Normal  Distribution 

(Assembler  Language  Subroutine) 


&INTGR 

&MATIV 


&HPLSB 

&SMSUB 

&PROS'J 


&SPLTR 

4 PRO BN 
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Peso 

57. 

58. 

59. 

60. 

61. 

62. 

63. 

64. 

65. 

66. 

67. 

68. 

69. 

70. 

71. 

72. 

73. 

74. 

75. 

76. 


ription  Source 

Variance  Ratio  Distribution  Function  &PR0BF 


Electronic  Warfare  Analysis  and  Simulation  Models 

Programs  used  in  Analysis  of  down  link  jamming  &D0ALL 

Model  Simulates  the  intercept  of  one  or  more  GEMM 

targets  by  an  Air  To  Air,  Surface  to  Air, 

Air  to  Surface  or  Surface  to  Surface  Missile 


A  missile,  targe,  and  radar  simulation  —  Simulates  BETA 
Blast-Frag  warheads. 

Model  for  AWACS  jamming  radar  and  expendable  jammers  EXPAND 


Model  calculates  propagation  loss  and  profile  of  COMTE 

transmitters  and  antenna  in  jungle  conditions 

Models  ideal  three  channel  monopulse  radar  SPACE 

Models  IR  missiles  and  simulates  ideal  tracking  DECOYAN 

Models  IR  missile  simulation  HOME 

Models  IR  missile  simulation  ATEM 


Models  surface  to  air  missile 


TWS4 


Models  radar  range  and  jammer  equations  for  multi-  Expendable  jammer 

jammer  sources  -  computes  jamming  to  signal  model 

(J/S)  ratios 


Models  effect  of  chaff 


SCARE 


Generates  multi  aircraft  flight  path  scenarios 

Terrain/Clutter  model  for  MPASS3 

Multi-element  radar/target/ jammer  simulation 
program 

A  command  and  control  campaign  model. 

Model  to  compute  harmonics  and  intermodulation 
Products 


MPAS SI 
MPAS S 2 
MPAS S3 


MECCA 

Receiver  model 


Model  for  general  theory  of  diffraction  GTD 

Model  computes  bes.se  1  functions  &RESDV 
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Descr in  t ion 


Source 


77.  NASA  TWT  program-simulates  TWT  response  TWT  program 

78.  Various  antenna  data  bank  and  data  reduction 

programs  to  catalog,  format  and  interpolate 
RADC  antenna  measurement  data 

79.  Marcum  Swerling  models 
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APPENDIX  B 


A  FORTRAN  program  named  ACT\'  was  used  in  this  study  to  generate 
the  activity  profiles  for  analyzing  where  to  apply  microprogramming. 

A  listing  of  the  ACTV  program  is  given  below.  The  program  was  writ¬ 
ten  to  work  with  HP's  RTE-IVB  operating  system.  The  ACTV  program 
basically  monitors  the  execution  of  an  application  program  by  period¬ 
ically  interrupting  the  application  program  and  storing  the  contents 
of  the  program  address  register  in  52  address  cells.  The  rate  of  in¬ 
terruption  is  variable  in  integer  increments  of  10  milliseconds.  The 
program  allows  the  user  to  interactively  specify  the  address  range  to 
be  monitored  and  permits  the  user  to  zoom  in  on  a  specific  address 
range  of  interest.  The  ACTV  prints  out  a  histogram  and  cummulative 
distribution  curve  of  the  activity  monitored. 

Appendix  H  contains  a  brief  discussion  on  how  to  run  ACTV. 


PR  CGr;  AH  AC  TV 

DIMENSION  FBF  (  5? )  ,  I  PR  k  Z2)  ,  IN(3> 

C  A C T I V I  T  Y  P ft 0 F  r L H  G £ A E RAT C f<  l ! S I N G  S USP E N D  A D D fi E S S 

C  FROM  ID-SEGMENT 

L,  b(  Jill  LEONARD 

0  1J  j  A  P  ri  V  i.  0  f  'i  I.  C  v  I ..  A  t’  j  IaI  I '  „ '  ».■ 

c 


b 
5  li 


WR ITE< i  ,  1  0  ) 

FOR HAT ( "  ACTIVITY  PROFILE  GENERATOR  ">//, 
i  “  TYPE  PROG  NAME  "  ; 


IN ( i >  =2H 


IN  ( 2  )  “2H 
IN  ( 3 )  --2H 
Rfc.AI.i  (  I  ,  20  )  IN 

20  FOR HA  f  <3A2) 

C  r.FT  ADDRESS  OF  ID  SEGMENT 

C  D  3  E  0  - 1 D  u  E  T  (In  ) 

I F ( I DSEG . NE . 0 ) GOTO  i 0  0 
WRITE U  ,  30  )  1  DIVES 

30  FORMAT!  "IMPROPER  PROGRAM  NAnF ,  Iu3FG= 

GOTO  Z 

100  WR I T  E  ( 1 , 1 1 0  ) 

iiO  FORMAT ( “TYPE  BOUNDS  OF  ACTIVITY  PROFILE,  LOWER-UPPER 
i  “  XXXXX  XXXXX  X") 


NT-0 


R 1-  A b  (1,1 2 0  )  LL,  f. li ,  N  F 
1 2 0  F 0 R h AT ( 2 K 6  ,  It,') 

IF (NT . LE . 0 )N i~3 

C  INITIALIZE  PROFILE  BUFFER 

DO  130  1*1,52 
FBF ( 1 )-0 . 

1.30  CONTINUE 

ID-.!  U  -  i  L  +  1. 

I NCR ~i iU-lL+1 )/S0 
I F  ( 1  h C R  s' 5  0  . 1  T  .  ID)  I N C  ■< =•  I N C R  ;  i 
(V  ).  —  I  !'.::>F G+ 1.  S 
,i.w  •  ! 

C  IF  f'Riii'.KAri  IS  NOT  C>j>RFNTI..Y  ACTIVE  DON'T  RECORD  l.  OCA  I  ION 

3  0  0  C  A  L  L  R  C  0  K  E  (  I W  l  ,  I V  A  L  ) 

IF  (I  AN  Of  I  VAl .  ,  .1  SB  >  .  NC  .  t  Ji.OTO  2  Pi! 

C  READ  GuS/' ENDED  LOCATION 

CALI.  RCURL(  IW2, 1  UAL) 

C  CHECK  FOR  BEFORE  BOUNDS 

P  1>I  k  U  -f  t.F  <  I.  J  ♦  t  . 


/•J  t  •  J 


“I- 
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r.  CHFCK  FOR  BEYOND  BOUNDS 

140  IFdVAL.LE.  J.U)GOTQ  ISO 

FBF ( 52) -FBF ( 52  > +  1 . 

GO  i  0  <20-0 

C  MARK  INTERVAL 

ISO  IVAL=(  TVAL-IL)  /INCR+2 
FBF  ( I V i-j L  >  =FBF  ( I VAL  )  M  . 

C  TERMINATE  MONITORING  IF  OPERATOR  BREAKS 

0  Cl  0  I F  <  i F fc RK ( IDMY ) ) SO  0 , 2 1 0 
210  ISf:=0 

C  WAIT  DESIRED  INTERVAL 

CALL  EXEC <12,150,1 ,0,-NT) 

GOTO  300 

SOO  UR  ITT (S, Si  0 ) IN  > IL , IU , INCR 

S.10  FORMAT ('} PROGRAM  ACTIVITY  PROFILE  FOR  " ,3A2,/, 

1  "  FROM"  ,KS,  "  TO"  ,K8,"  IN  INCREMENTS  OF", 10) 

SIS  FORMAT  <  "  INTFRVAI.  NO.  FROM  TO  NO  OF  HITS 
1  ,  “N  u  I'  mAlIZED  HI  T  S  NO  R  M  A I .  A  C  C  liM  "  ) 

C  FIND  >"i  A  X  VALUE  OF  HISTOGRAM 

FMa  =  ~  i 
TSUM-0 . 

DO  S20  I - 2 , S t 
TSUM-TSUm  lFDF  ( I ) 

I F (FMX  IT  FBF ( I ) ) FMX-FBF ( I ) 

520  CONTINUE 

C  EXIT  IF  NO  ACTIVITY  IN  DESIRED  RANGE 

IF < FMX . GT  0 . ) GOTO  60  0 
IF  < ( FBF ( 1 ) +FBF (S2) )  GT.O.)GLTO  S40 
WRITEd  .,530) 

S30  FORMAT <  "Mu  PROGRAM  ACTIVITY  RECORDED  —  AT  AI...LM!") 

WRITE (6,530) 

STOP 

S40  WRITE < i ,5S0)FBF< i ) , FBF (52) 

SSQ  FORMAT ( “  NO. PROGRAM  ACTIVITY  IN  REGION  OF  INTEREST" ,/ 
l  , "BEFORE®" ,E13 .  7, "  AFTER*  "E 13 . 7 ) 

WR I T E  ( 6 ,  S S  0  )  F  h F  <  1 )  ,  F  BF  <  S2 ) 

C  WRITE  TABLE  OF  ACTIVITY  PROFILE 

600  WR  ITF (6 ,SiS ) 

SUM-0 . 

TSM1=TSUM+F BF ( 1 ) *FBF ( S2 ) 

D  0  to  '.>  0  i  1  .  '.->  2 
6  Ufi-::  ■>  U  M  ♦  F  i  i  F  «,  T  )  /  T  3  ii  1 
(•  NORM-r  BF  ( I )  /FMX 
IFF  - I l  < LNCR  * ( I  -  2) 

1TO--I F  R  +  INCR 

IF ( I . EQ . 1 >IFR =0 

IF  <  I.  EGI  .S2)  110*32767 

WR  ITF  ( ,  6  V  0  )  r  ,  TER  ,  I  rn,::  O'  l  > 
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610  FORMAT <4X, 13, 6X,2K7,F1G . 0 , F 1 7  8 , FlS  5 ) 

oSO  C' * 1  * ! T  j- fi i  i T- 

C  PLOT  HISTOGRAM  ON  PRINTER 

yi? ITT.  ( t, ,  5 \  0  >  T  N ,  II... ,  T. 0 , 1 NCR 

70C  FORMAT ( “  INTERVAL  024$ 
1  S  "  > 

C  FOR  EACH  DATA  INTERVAL 

! ;  Un  =  -  F  B F  v  i. )  /  “  5 1 1 .1 
DO  SUB  J=l>52 
C  CL  FA"  PRINTER  BfJEFFR 

DO  7 10  1=1,51 
IP  R  ( I ) =2H 
710  CONI  I NOE 

c  calculate:  indexs 

f.c.M  im  t-rpc  >;  :r  >  ,•  i  st ;m 

1NDX*SUM#50  .  *-1  > 

IF ( ( J . NL . 1 ) . AND . a . NF . 52) ) IPR < INDX >=2HII 
I  NORM  ••-SO  .  *FBF<  T  >/FMX+i  5 
C  PRINT  AN  X  If  Off  PLui 

i:  F  ( 1 N  0  M  ft.  1.7  1>T  NOR  r,  -  - 1 

If  VxlVf.iRil.Gi  .  5  i.  i  1  iV j, 

0  PRINT  AN  *  IF  ON  THE  PLOT 

IF  (  I  NOR  Pi .  GT  .  0  )  I  PR  ( I  NORM )  =Z)i  10  0 
IF  <  I  NORM  .  i...  T  .  0  >  I  PR  <  -  i  rf  '?’.M  <  LHXX 
UR  IIP  it* ,  720  )  J  >  i  £PR  (U  )  ,K»1  , 5 1 ) 

720  r  O  R  N  AT  i  2  X ,  I  fa ,  3  X ,  5  i.  A 1 ) 

300  CONTINUE 
STOP 

r-  ’  P; 

En  D  :>> 


f 

I 

I 

.!: 

i, ; 
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APPENDIX  C 


The  following  is  a  list  of  on-base  facilities  using  HP  computer 
equipment  and  personnel  surveyed  for  this  project. 


1. 

Bill  Griffin 

ASD/ENAD 

54061/52789 

Bldg  125 

2. 

Jim  Leonard 

AFWAL/AARF-2 

53050 

Bldg  23 

3. 

Bob  Ballard 

AFWAL/FIMN 

52593 

Bldg  450 

4. 

Bryan  Kent 

AFWAL/AAWP 

55076 

Bldg  821 

5. 

Blenn  Williams 

AFWAL/FIMN 

52493 

Bldg  26/240 

6. 

Jim  Mosora 

AFWAL/MLPO 

53808 

Bldg  651 

7. 

Larry  Linder 

AFFDL 

55205/56795 

8. 

Dr.  Gary  Lamont 

AFIT 

53057 

Bldg  620 
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APPENDIX  D 


"  <3«S» 

The  HP-21  M-Serles  User  Microprogrammable  Computer 

Introduction 

.The  purpose  of  this  section  is  to  review  the  structure  of  th£ 

HP  21MX  M-Series  Computer  and  to  describe  the  HP  software  support 
tools  available  for  microprogram  development. 

21MX  Computer  Description 

The  HP-21MX  computer  is  a  16  bit,  user  raicroprogrammable  general 
purpose  minicomputer.  Figure  11  shows  a  functional  block  diagram  of 
the  computers  architecture.  A  good  understanding  of  the  architecture 
is  a  prerequisite  to  understanding  and  applying  microprogramming  effec¬ 
tively.  The  computer  architecture  is  typical  of  most  computers  and  can 
be  broken  down  into  the  five  general  sections  shown  in  Figure  11.  The 
sections  shown  on  Figure  11  are  the  arithmetic  and  logic  section,  com¬ 
puter  control  section,  main  memory  section,  fror*-.  panel  section  and  the 
input/output  section. 

Also  shown  on  the  diagram  are  the  three  main  16  bit  data  paths 
for  the  computer,  namely  the  S-bus,  T-bus  and  I/O  bus.  It  can  be  seen 
that  the  various  functional  sections  are  conne  .ted  via  these  buses. 

The  S-bus  is  the  central  data  transfer  path.  The  S-bus  can  accept 
data  from  all  registers  except  the  L-register,  SAVE  register,  RAR, 
Extend  register  and  the  Overflow  registers.  The  Extend  and  Overflow 
registers  are  not  shown  on  the  diagram  however.  Any  data  placed  on  the 
S-bus  is  available  to  any  of  the  following  registers: 

M-register 
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FRONT  PANEL  SECTION 


CIR 


I/O  SECTION 


L-regtster 

CNTR  (Counter  register) 

Display  Register 
Display  Indicator 
Instruction  Register  (IR) 

The  T-bus  is  a  one  way  data  bus.  Its  purpose  is  to  receive  data  from 
the  Rotate/Shifter  (R/S)  and  selectively  pass  this  data  to  any  of  the 
scratch  registers  (SI  through  S12),  the  A  or  B  registers,  or  the  X,  Y, 

P  and  S  registers.  The  X  and  Y  registers  are  often  used  as  index 
registers.  The  P-register  is  used  as  the  location  address  counter. 

The  S-register  is  used  for  the  front  panel  switch  registers.  The  I/O 
bus  permits  transfering  data  to  and  from  peripheral  devices  under  pro¬ 
grammed  control.  For  example  the  graphics  terminal  or  printer  devices. 

The  I/O  bus  interfaces  directly  with  the  S-bus. 

Arithmetic  and  Logic  Section 

The  arithmetic  and  logic  section  performs  all  arithmetic  and  func¬ 
tional  modifications  on  the  data  for  the  computer.  As  pictured  in  Fig¬ 
ure  11  this  section  is  composed  of  three  main  units:  the  arithmetic 
and  logic  unit  (ALU),  the  Rotate/Shifter  (R/S)  and  22  local  registers. 

In  addition,  six  flag  registers  are  available  (used  to  indicate  special 
conditions)  which  are  not  shown  on  Figure  11.  The  ALU  is  designed  to 
receive  16  bit  data  from  the  S-hus  and  from  the  L-reglster  (latch  re¬ 
gister)  and  pass  the  resulting  computation  serially  to  the  Rotate/Shif¬ 
ter  (R/S)  unit.  A  wide  variety  of  shift  operations  can  be  performed  on 
the  data  that  leaves  the  ALU  unit.  The  R/S  puts  the  results  onto  the 
T-bus  for  assignment  to  one  or  more  of  the  22  local  registers.  The  ALU 
op, -r.  it:  i,  ii'i  include  ti:o  followin': 

• 
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1. 


ADD 


6. 


EXCLUSIVE  OR 


2. 

SUBTRACT 

7. 

NASD 

3. 

MULTIPLY  STEP 

8. 

NOR 

4. 

DIVIDE  STEP 

9. 

INCREMENT 

BY 

ONE 

5. 

COMPLEMENT  (NOT) 

10. 

DECREMENT 

BY 

ONE 

Computer  Control  Section 

The  computer  control  section  provides  control  over  all  HP-21MX 
computer  operations  and  data  transfers  by  means  of  a  microprogram 
stored  in  control  memory,  either  ROM  or  Writable  Control  Store  (WCS). 
Included  in  the  control  section  is  the  16  bit  Instruction  Register  (IR) , 
a  12  bit  SAVE  register,  12  bit  ROM  address  register  (RAR) ,  24  bit  by 
4096  word  control  store  memory  and  the  24  bit  ROM  instruction  regis¬ 
ter  (RIR) . 

Control  Store 

The  control  store  provides  for  microprogram  storage  in  both  non¬ 
volatile  ROM  and  Writable  Control  Store  (WCS)  modules.  The  control 
store  for  the  HP-21MX  is  configured  into  16  modules  (numbered  0  through 
15)  of  256  words  each.  A  12  bit  address  word  is  used  for  addressing 
control  store.  This  means  a  total  of  4096  words  of  addressable  mem¬ 
ory  is  available  for  storing  microinstructions.  Modules  0,  1  are 
reserved  for  holding  the  HP  basic  instruction  set.  Under  normal 
circumstances  those  locations  are  not  available  for  user  microprograms. 
Generally  modules  12  and  13  are  assigned  for  user  microprograms.  The 
writable  control  store  boards  are  usually  assigned  to  module  12  and  13. 
All  other  modulo  space  fs  reserved  for  HP  options.  However  if  the  HP  op¬ 
tions  are  not  instilled  in  the  computer  the  user  mnv  use  these  module 
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locations  by  installing  ran  re  writable  control  store  boards  or  user 
burned  PROMs.  The  control  store  word  length  is  24  bits  wide  as 
apposed  to  tire  16  bit  long  words  used  for  main  memory  instructions. 
When  installed,  modules  14  and  15,  contain  the  microinstructions  that 
interpret  the  floating  point  instructions  and  extended  instruction 
group  provided  by  HP. 

Main  Memory  Section 

The  main  program  memory  provides  for  storage  of  data  and  higher 
order  languages  like  FORTRAN,  ALGOL,  etc.  Included  in  this  section 
are  the  M-register  and  the  T-register.  The  normal  machine  instruc¬ 
tion  stored  in  Main  Memory  is  16  bits  long.  The  M-register  is  equi¬ 
valent  to  the  Memory  Address  Register.  The  M-  register  holds  a  15  bit 
address.  This  means  up  to  32K  of  main  memory  can  be  directly  access¬ 
ed  through  the  M-register  address.  The  M-register  is  loaded  with  an 
address  from  the  S-bus.  The  T-register  is  a  16  bit  buffer  register 
connected  to  main  memory  and  the  S-bus.  Main  memory  data  words  are 
transfered  to  and  from  the  S-bus  through  the  T-register. 

Front  Panel  Section 

The  front  panel  section  serves  as  a  basic  interface  between  the 
operator  and  the  computer.  Included  in  this  section  are  two  registers 
the  display  register  and  the  display  indicator.  Both  registers  are 
controlled  by  the  base  microinstruction  routines.  the  display  regis¬ 
ter  is  a  set  of  six  indicators  which  display  the  contents  of  the  A,  B, 
M,  T,  P  or  S  registers. 

Input / Oat  put  sect  ion 

111-*  i  »'■'!»  I  /*•*:»  :  *’«  •  r  . . >;*••**  !  «' 


interface  with  external  devices.  Included  in  the  I/O  section  Is  the 


A0-A11S  b64  AIR  FORCE  INST  OF  TECH  MRIGHT-PATTERSON  AFB  OH  SCHOO— ETC  F/G  9/2 
MICROPROGRAMMING!  A  TOOL  TO  IMPROVE  PROGRAM  PERFORMANCE. (U) 

OEC  81  J  J  STEIDLE 

UNCLASSIFIED  AFIT/GE/EE/810-56  NL 


central  interrupt  register  (CIR),  Interrupt  contr.  I  (IAK),  and  I/O 
control  and  select  logic  decoding.  The  CIR  is  a  6  bit  register.  It 
contains  the  select  code  (address)  of  the  interrupting  device  after 
and  interrupt  is  recognized. 

Software  Support  Tools  for  HP  Microprogram  Development 

Microassembler:  X  MICRO 

The  microassembler  is  a  Hewlett-Packard  software  utility  program 
that  processes  Mneumonic  microprograms  and  produces  the  binary  24  Bit 
word  patterns  (object  program)  that  are  to  be  loaded  into  WCS  for  con¬ 
trolling  the  computer. 

Micro-Debug  Editor:  %  MDEP 

The  Micro  Debug  Editor  software  allows  the  user  to  load,  debug, 
and  execute  the  object  microprograms  output  from  the  microassembler 
into  a  Writable  Control  Store  module. 

Writable  Control  Software:  %DVR36  &  %WLOAD 

%DVR36  is  the  driver  software  for  the  Writable  Control  Store. 

T’ais  software  takes  care  of  the  data  transfers  through  the  I/O  section 
and  maintains  conformity  with  the  RTE  operating  system. 

The  WLOAD  software  permits  loading  the  WCS  with  microprograms 
from  a  disk  file  or  I.U  device. 
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The  following  listing  shows  the  annotated  contents  of  the  answer  file 

used  in  this  study  to  generate  the  RTE-III  system  to  support  microprogramming. 


YES 
'I  0 

S  A  2  3  6  1  ,  ,  10. 

OB 

12 

0  0  0,  0 ,  0,2 ,  0 , 22 
256, 0,2,2,  0,S 
53S ,  132.2.2.  0,  22 
/£ 

4S 

1 

NO 
1  1 
0 

NO 
YE 
YE 
50 
1  12 
0 

MAP  MODULES 
LINKS  IN  CURRENT 

* 

«  RTE-III  SYSTEM  MODULES 
-* 

EEL,  ';CE33 V,  ,  JO, 

EEL ,  T.’SYL  IS ,  ,  10, 

EFL.LCUe,  ,10, 

EEL, ERL  181 ,,10, 

EEL , LRL I B2 , ,10, 

REL./LKMLIB,  ,  10, 

BEL,  LTCMDS,  ,10, 

K'tr.L ,  .-.BNPG1  ,,10, 

EEL,  L’lSMPGO,  ,10, 

KEL,  ’.BMPG3,  ,10, 

EEL, NDt-XAR, , 10, 

EEL,  LI  NET,  .  1  0, 

EEL , LMTM, , 10, 

EEL ,  L'tEVMP ,  ,10, 

EEL, LEF4 . N, ,10, 

PEl_,  LDVROO,  ,10, 

KEL,NDVR3<S,  ,10, 

EEL,  L3UP43,  ,10, 

l-'t.l.  ,:MDY0-3,  ,  111, 

EEL, LDVR32, ,10, 

!  (  .  : !  i  T I  iR  ,  111 

i  -.i  ,  i  ,  1  I*, 

EEL,  KEC)  I  TE:,  ,10, 


♦-ECHO 

k  * TRACKS  IN  OUTPUT  FILE 
-*•  OUT  PUT  SYSTEM  FILE  NhME 

*  DISC  MODEL- 

*  D I SC  SELECT  CODE 

*  SUBCHANNEL  0 

*  SUBCHANNEL  I 
k  SUBCHANNEL  2 

*  TERMINATE  SUBCHANNEL  DEF 
k  it  120  NO fti>  SEC TCORS/TRI; 

*  SYSTEM  SUBCHANNEL 

*  AU.Y  DISC 

*  TriG 

k  PR IV  INTERRUPT 

*  PR  IV  DIR  VERS  ACCESS  COMM 

*  FG  CORE  LOCK 

*  fg  cure  lock 

*  SWAP  DELAY 

*  MEM  SIZE 

k  BOOT  FILE 

*  PEL 0 ACTE  MODULES  BY  NAME 
k  CURRENT  PAGE  LINKING 


•k  MEMORY  RESIDENT  SYSTEM 

*  RTE  SYSTEM  LIBRARY 

*  RTF  COMPILER  LIBRARY 

k  RTE  DOS  LIBRARY  PARTS  15', 2 
k  R  TE/DOs  L I  URaR’V  I  ART  2 
**  RTF  BA  f CM  L  i BI-:aRY 

*  RTE-III  COMMAND  PROGRAM 

*  RTE  L-:i»fCH  MON1  DO  ft  PROGRAM  PORT  1 

*  RTE  BATCH  MON I  TOR  PROGRAM  PaRT 

*  RTE  BATCH  MONITOR  PROCRhM  PART 
♦*  RTE  DECIMAL  STRING  ARITHMETIC 

*  RTF  III  LOaPR 

*  RTE  MULT i -TERM INAL  MONITOR 
K  TP  VHP 

*  FORTRAN  IV  FORMA  TEE 

*  RTE  T I Y/PUNCH/PHOT 0  READER  DVR 
=K  RTE  WCS  DRIVER 
ik  POWER  FAILURE  DRIVER 

*  a>i  DRIVER 
IK  r 9 06  DISK  DRIVER  * 

-  r  sft  f 

.  <- 

k  EDITOR 


Ktu ,  ,  .11.1,  *  CRGCS-fcEf't-ReNi.  t 

Pt.L ,  .,iu,  *•  MSSErusL£K 

RFL  ,  LbWTC  H  ,  ,  10,  ►  SMI  TCH  GP£F.A  f  INC  SYSTEMS 

KLL,  .".Lu,  .10, 

r  c!i- . r . ,  i  o, 

PEL,  -WRONG ,  ,  I  0  , 

Kiri. .  r.Mif  rg  to  ’  !*■  re  n  t  c  s*OhSS  f  ksls*  f. 

PEl,  .‘.’n.'.KtiR  ,  ,  t  0  ,  t-  R  .  £  i'i  1 1" Ru  CRObb  ^SSSMSlSk 

ft£L,  .LWLOPD,  ,  t  0,  ■*  PTE  rj  l  CKO  LO,;l-  UT  I L  IT  Y  .ROUT  IN 

/  'h 


D  .  K  I  K  ,  1  ,  i 
*>CMo ,5,1 
MhL  r,  i  ,  1 ,3 
PPOtiG,  1 , 32  ~E7 
Hbhb  .  3 
.Kfcfc  ,  3 
L  0*4  DR  ,  3 
EL'  I  T r<  ,  3 
Pi  f  ’T Oh  ,  5  ,  1 
PR  hi-'  f  ,  3 
R-f  PN  r  ,  5 ,  5  (■< 

/£ 

MACRO'S 
.  hi- Y  ,  RP  ,  1  00 

.  0 1  V  .  KP,  1  0 0-!  00 
.  £>L  i  >  .  RP,  1  04  2  0  0 


.  Ob  1  , 

RP 

,  1 

04 

4  0  0 

*-  FLO 

>mT 

Mm2 

PCS’ 

.  F  rtL>  . 

,  RP 

,  1 

03 

0  0  0 

.  F  , 

.  RP 

,  1 

uS 

02  o 

.  FHP  , 

,  F  P 

.  ) 

05 

04  0 

.  I-  L’V , 

.  PP 

,  i 

0’-* 

OoU 

I F  I.:, 

,  RP 

.  1 

ic« 

1  0  0 

Fi.Oh 

r ,  r 

P  , 

i 

3  l  2 

.  hVw. 

.  RP 

,  i 

03 

l  »  t' 

Z  MX<L  ,  RP  ,  3 

/t 

1  b 

i  0  0 
4  0 
1.1 

1 23 ,512 


4.  RnKHl1fc.V0.Kb 


■■* TERMING  re 


*  f?  fciLOi  li  i .  SfcGi 't-.fi  FR 

»  }.hJJK  .  1  i.1  .r.  >  jii ...  i »  I  b 

•*  nh.y.  4  PhPTI  riL'i ib 

*  i-Ofi  8i-  L  i ill  nut 

4  4  OF  TO  V i.Hbb£b 
«  »  i_W  IIhPP  I  1  I  Li 

••*  #  kN 

i.  SUFFER  L  I  PIT  TS<  LOW  -  HIGH) 
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V  EQUIPMENT  Th&LE  ENTRIES 

* 

1 2 , DVR 32 , D 

EOT  #1 

1  3  .  DVR  05  ,  B,  y.~  1  3  ,T  -  1  S  0  0  0 

*  EOT  112 

10..  DVR 3 6  aai^ 

*  EQT  #3 

2  0 , D  VR  0  0 , & , T~ 1 3 0  0  0 

♦EOT  D4 

1  7 ..  DVR  01  b ..  T— i:<  0  0 

■KtO  T5 

4 ,  DVP43 

*  EOT  #6 

,'E 

* 

■* 

•*  DEVICE  REFERENCE  TABLE 

2 . 0 

*  LUt  SYS .  CONSOLE 

1  ,  1 

LUjf  SY'j  .  OlbK1,  LO’JbK  PLOTTER  > 

0,  0 

•«'  LU3  RlJ.V.  DISC 

2,  1 

*  LU4  L.  CTU  DRIVE 

f  LU5  ft.  CfU  DRIVE 

0 ,  0 

LUo 

0,  0 

*  LU? 

0 ,  0 

*  Lua 

0 ,  0 

:*.•  LU9 

4, 0 

•* 

LU1  0 

n  v 

5/  0 

*  * 

LU  1  1 

PrPCR  TrPE  READER 

0,  0 

■¥ 

LU  1  2 

0 ,  0 

* 

LUt  3 

0 ,  0 

■* 

LU  1  4 

3,2 

* 

LUt  5 

MO?  MODULE  1 2 ••>?&■« 

0 ,  0 

Lut  a 

0 ,  0 

:+ 

L  U 1  7 

0,  0 

+ 

l  u  t  a 

0,  0 

LU  1  9 

1  ,  0 

•* 

LU2  0 

DISK  SUBCHANNEL  0 

1,2. 

* 

LU2  1 

DISC  SUBCHANNEL  2 

0 ,  0 

* 

LU22 

0  .  0 

* 

LU23 

0,  0 

:¥ 

LU24 

C>  /  V 

f- 

* 

LU25 

POUER  Em  I L 

*  INTERRUPT  TABLE 

•f 

4 , ENT , $PGWR 

12,  EOT, 1 

1 3,  £01 , 2 

i 2 ,EOT ,5 

u  .  IV  U  ,  PPmk  i 

90 

i 

i 

i 

i 

1 

T  lj'> 


U 

c* 

vt‘-> 

YES 

0 

1 ,4, EG 

,  i  •  rilj 

5 ,  to..  SC 

,  t  ->  KG 

,  i  \-i,  oi j 

6 ,  19,  E:G 
7  ,  1  9 , 9  G 
/E 

r  i  i  L  K  ,  1 

ED  I Fk,  14 
LUOL'K  ,  1  *» 

r)SM9  ,  1  t* 

vNREF  •  t  5 
■  f 


The  following  generation  listing  was  produced  by  the  On-Line  Generator 
(RT3GN)  for  the  RTE-III  system  generation  done  for  the  microprogramming 
configuration  in  this  study. 


f 


ECHO? 

TR ,  A'?236  / 

ECHO? 

YES 

EST.  41  TRACKS  IN  OUTPUT  ^ILE? 

4  0 

OUTPUT  FILE  NAME? 

SSi'Sc*  i  ,  /  I  0, 

TAPGFT  DISK? 

79  06 

CONTROLLER  CHNL? 

12 

#  TRKS,  FIRST  CYL  U ,  HEAD  #,’  # 
00? 

3  0  0 ,  0 , 0 , 2,  0 ,  22 
0 1  > 

256 , 0,2,2, 0,6 
02? 

536 ,  1 32 ,2,2,  0 , 2^i 
03? 

,'E 

#  123  WORD  SECT ORS/TRACK ? 

43  .. 

SYSTEM  SUBCHNL' 

1 

uu:<:  Disc  <  YES  OR  Nu  OR  It  TRKS> 
NO 

TOG  CHNL? 

I  » 


>£CHO 

K  ti TRACKS  IN  OUTPUT  FILE 

*  OUTPUT  SYSTEM  FILE  NwME 

DISC  MODEL 

*•  DISC  SELECT  CODE 
SURFACES,  UNIT,  41  SPARES 

■*  subchannel  0 

*  SUBCHANNEL  t 
SUBCHANNEL  2 

«<  TERMINATE  SUBCHANNEL  DEF 

41123  WORD  SEC  T GORbV'  TF.K 

*  SYSTEM  SUCCHhHNEL 

y 

*  rtUX  disc 

*  TNG 


i 

I 
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l 


Fk  I V  .  IHT.  CARD  hDDR •* 

li 

FF IV.  DRIVER.}  riLlfob  X.  WIML'M  . 

HO 

EG  CORE  LOCK? 

Vt 

EG  CORF  LOCK? 

VE 

SuinP  DEL  mV? 

5  0 

rit'ri  SIZE? 
t  1  2 

BO  OT  FILE  HO  ME  ? 


*  PR i V  TmTEPFOPT 

*  PR IV  f>  I  EVERS  ACCESS 

¥  KG  CORE  LOCK 

«■  K  UKIi  LUl.  < » 

*  SOAP  DELAY 

f  HEM  SIZE 


0 

PROG  INPUT  PHASE : 

* 

MAP  MODULES 

LINKS  IN  L OFF tUT 

*  RTE-III  SYSTEM  MODULES 


r 


kEL  . 

*;t.  R  3oV , 

,  1 

0  , 

EEL , 

S  'f  L.  I  &  / 

,  1 

0 , 

RET-  / 

YC  Lib'.  , 

1  u 

' 

PEl  . 

YPl i B I . 

.  i 

:T . 

REL , 

YRL I &2 , 

,  1 

0, 

PEL, 

,  YGML I B , 

,  1 

0, 

REL , 

,  YTCMD3, 

,  1 

0, 

RE L  , 

,  YUrjPG 1  , 

.  ,  1 

0, 

■*  BOOT  FILE 

i*  F  EL 0 AC  TE  MODULES  BY 
*.  L-.'  >  I  i*  r ■ :  —  I  Hi'.  I  I'l 

¥  DEMUR V  RES  iDEfiT  SYSTEM 
*  Fit  SYSTEM  s.IOkwRi 
•*  R  f C  COMP  I L  £P  i_  I  BEaEV 

i  pK  .  r.-os  i  R  !•  aR  (  par  t 
k  k  re/oos  library  part  e 
*>  rtf  Batch  library 

¥  RTE-III  COMMAND  PROGRR 

¥  R i £  batch  duhi  tor  .froc 


COD  I'l 


name 


pi 

kmm  part  i 
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REL, XBNPG3, ,10, 

* 

REL, MOECAR  .,10, 

♦ 

REL, XLDR3, ,10, 

* 

REL, TiMTM,  ,10, 

* 

RFL ,X*PVMP,  ,  10, 

* 

REL , 7CFF4  .  W,  ,10, 

I* 

rel,::dvroo,  ,  i  o. 

* 

REL,XDVR36, ,10, 

'¥ 

REL , 7I3DP43, ,10, 

* 

REL,?MDV05,  ,10, 

* 

REL,r,’DVR3C',  ,10, 

* 

REL,  JIAUTOR,  ,10, 

REL, MWH2T3, ,10, 

* 

rel,,*:editr,  ,10, 

:♦ 

REL ,  JiXREF ,  ,10, 

•  * 

rel.,;:asmr,  ,  i  o. 

PEL ,  TiSWTCH  ,,10, 

*♦ 

REL ,  XLU,  ,10, 

Rt-T  ,’iTT,  ,  1  0, 

REL ,  TiPPONG ,  ,10, 

REL , T.’MICRO ,  ,10, 

+ 

REL ,  JiMMREF ,  ,10, 

'■¥ 

REL, UNLOAD,  ,10, 

RTE  BATCH  MuH  I  Tuft  PROGRAM  PaPT 

RTE  DECIMAL  STRING  ARITHMETIC 

RTE  III  LOADR 

RTE  MULT I -TERM INAL  MOM  I TOP 

JrPVMP 

FORTRAN  TV  FORMATER 

RTE  TT Y/PUNCH/PHOT 0  READER  DVR 

RTE  UICS  DRIVER 

POWER  FAILURE  DRIVER 

264SA  DRIVER 

7906  DISK  DRIVER 

AUTO  RESTART 

WH2AT? 

EDITOR 

CROSS-REFERENCE 

ASSEMBLER 


SWITCH  OPERATING  SYSTEMS 


RTE  MICROASSEMBLER 

RTE  MICRO  CROSS  ASSEMBLER 


RTE  MICRO  LOAD  UTILITY  POUT I WE 


MO  UNOEF 


PARurlfcTEftS 

D .  RTR ,  1  ,  1  *  PAftArlEThftS 

f*CMC»,  ,>,  1 

WH2AT 

PRONG ,  I  ,  'S&7 67 
nSriB  ,  3 
PftFF ,  3 
LOADft, 3 
F D I TR , 3 
nOTuft, 3, 1 
ft  ftp? FT  ,  3 
ft  t  Fr  i  r ,  3 , 5  0 

,  ’E  ■«  i'ERMINPTE 

CHANGE  ENTS? 

*FAIJ  MACRO'S 
.  i'iM  i  ,  ftp  ,  t  nOR'OO 

.  C>  I V ,  ft P ,  t  0  04  0  0 

.  OLD . ftp ,  1 04500 

,  PST ,  ftp' ,  t  044  0  0 

¥ 

*  FLOAT  MAC-fcOS'S 
.  f  a  r> .  rp  .  t  or;ooo 


.  £ .  ftp  ,  1  1C..".-.  0 

•  -  .  -  !  •  ■  II' 

I 


.  I-  0 V  .  ftp  ,  1  ii  Ok  0 
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IF f X , Kp .  I  OF  1 00 
FLOAT ,  hP  ,  1  uf<  1  2  0 
,  MVW,  PP  ,  t  0577? 


2  tl>et. ,  PP  , 


.  £ 


»  OF  BLANK  ii>  JfcUFJcH'f  S  ? 


«•  ;t  BLANK  10  SFOIFN  (”*£• 


t»  OF  BLANK'  BO  BEG .  lit  SEGrfENTS ' 


15 

*  SHORT 

K>  StGMhNTS 

riAft  NUrtBE*  OF  POP; 
y 

TIT  I  ON": 

■*  iU iX  .  4 

P;>i"'  *  i  P  1 0??3 

FWm  BP 
I  (t  0 

LINKAGE*' 

*  FMh  BP 

L  I f>KAi»L 

SYSTEM 

*  OPSY< 

OOBB  > Of 000 

ii  i  7  ,"? 

5  2  u 6  0  - )  2  0  0  7-  P.  2  V 

.  1740  7 7  0S) 

4 

D  I  S'PtK 

CtyOy  >0:4  03  t 

047.7* 

9  2  0  6  6 -lo013  p‘  £  V 

f 

. 1726  7 7052 

7 

RTIHE< 

665’?  >65  O  S  i 

656  64 

•? .-j  OF.  0-  ?  c-  i.i  ?  4  F F  V 

.  i  7 i 0  7  7  0  i  3 

1 

Th  SCrK 

0  uyy .)  0“6  t.c* 

65677 

62  OF  0- 1  6  u  I  5  i'EV 

. (63?  76 062 

_/ 

R  T  i  uC< 

665'?  >6576,' 

i  24  )  3 

S3  OF  0- 1 6  0 ) 6  REV 

.  i 74  0  >7 092 

i 

EMECCX 

0655  >  1  244S 

)  434  1 

■  •  .  MS  6-  1  <-  0  1  6  REV 

174  0  77 OB ? 

4 

» fFfrK 

0 055 > 1 4 55  f 

’45?  4 

i"i  ?  s'.  0  1  RE''' 

.  75  0726 

- 

SCHEIX 

0  OS':/  >  1  4540 

705^  0 

-  4  7 1 '  0  0  -  t  i  i  2  6  P  E  V 

.  17  10  77 OS  t 

4 

?  olo  >■ 

069  ?  >2  6555 

:  ?  »  i  i'i.i 

.» '  >.i>  ;i-  ;  c-  0 1  7  0  FV 

r)  75  05  ''5 

F'iCLO'. 

6699  >2  1  0  u'3 

2  t  0  02 

9  ;*;  0  0 1  -  ?  6  0  05  ••  £  V 

1 74  0  77  O' l 

(Cl  I B-; 

0  699  >2  1  0  03 

2  1  6  62 

02  oc'.  i_\  -  )  7  i  i  OS  P£V 

. 1 726  77052 

3  tC-U  i  6 

RL  I B 1  •: 

6  099  .>2  )  Ci  03 

3  1  0  if  2 

14003-1  6Cu<1  R 

EV.I740  77 

06 1  2 

f*:L 

0  09y  >2  J  0  u  5 

J  1  0 1 0 

34  .:■?  >  -  i  6  i.i  0  ? 

L7  .  1  .  4  0 

uG  1  2 
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* 


t 


TBUOf s‘.  0 0^9  >2i  O'* 3  21  0  02 

92  0  0  t  >  0  0  i  = 

•  1  O  ;■  1  or  *  0 

FF4  .  ri>;  0099  >21  0  03  21  0  02 

2*499:3- 1  o  0  02 

REV.1 7-0  770525 

f> Vk 0  00  >j 07  >y  1  0  1  7  7  i , '  - 

' '  -  '  ~  L‘  * ;  1  r  r  ■/ 

1  .  -i  v  7  0  0  2 

OVPSo-.  Oo99  '2221  0  2-u  ■  f 

i  L  l  '  |  7  |  7 

-  1  r-  It  Is  1  Rj.  v  .  *  1 

OVP't  30  0029  >2"!  27?  231  i  0 

3i  Li E  >0-  1  9  00  (  PFv 

•  1  033  7 F.  nO,  0 

O'  V  R  0  5  <  0  0  9  9  >  25174  3  0  0  f  <7 

92  u  0  i  —  i  c.  02  7 

RFv  .  ,  S  i.i 5  i  ii  0-  7 

DVR32C  0  099  >3  0020  3 » 545 

9 2  0 0  0  -  (  0  0  s  t  k  E  V 

4  75  )  02' 4 

*•«  uF  I /Ci  C LOSSES’ 

4  0; 

>  tf  OF 

i O'  classes 

*■!»  OF  LU  HAPPI/UVS"* 

0 

«  4  t  U  N 

;FP  IDO 

*  tt  OF  RESOURCE-  MUrlorRS*' 

■«  Ff i 

BUFFER  LIMITS  <  LOU ,  HIGI- 

1  ..•? 

1 29 ,512 

i-  BUFFER 

L  I  s'i  i  T *  L  uU ,  / 1 1  Ci  /  ’> 

*  FOUIFtiEMT  TUBlE  EUfRV 

EOT  0  1  ? 

• 

*  F Oi t  IRMC-U T  T  EEL fc  Fi  TI F:  I  i: 

1  Z’ ,  1  'VEST- ,  u 

*  EOT  41 

EOT  02? 

1  :  1>VR  05  .  t: ,  X~  1  3  .  T  •  1  8  o  O  O 

*  FO'f;?2 

L'OT  03? 

1  *’  L'VIr  So 

*  t  •  r  . 

7 

£0  i  04? 

i-1  i  'Vk  ii  0 0 ,  r  -  1 S  u  li  0 

kFOT  ;m 

CO  T  Or.  '■ 

1  7  ,  DVR 01  ,  t-: ,  i  500 

*•£0/7 

h o  r  Ob'- 


F1"'  I  0  7 
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t 

( 


£'  --  EOT  ii  ~ 
1  1 

v  =.  F.r-T  I  i7 

(i  ,  0 

4  -  Ft',  r  it  ' 
_• .  1 

••  ■  EOT  if-' 


F  -  fcul  4*7' 
0 ,  o 

7  -  F  0 1  »? 

r.  .  t": 

S  =••  FuT  n 
u  .  u 

■7'  i- >7  T  „  - 


i  ft  •-  for  tt  •? 
4  ,  n 

ti  -  Far  »  - 


1  2  -  [7  0  T  if- 

i  t .  i.' 

I  .*  *  EOT  B  :■ 

II  .  1.1 

i  t  Far  tr- 


t  i_'.u  iVc .  cc.*i-ius.e 

>  LU2  SYS  .  6I£t  ■■  i.Q’JEti:  FL’-i  r  TEE:  • 

*  t-U7  au;-  .  i.< I-;.*- 

4  t.  i  i  )  t-  .  l'  |'l  I  !'!■•■  I 
~-.j¥ 

*  LU5  R .  CTU  uRJVF 

4  L.U6 

4  Lit  7' 

t  i..  U5t 

*  i.i.iy 


4  L. ‘  5  1  1 

4  t.llt,' 

4  LillF 


1 


k 


I 


!  I-  *  e  a  r  *t  ? 

5 , 2 


■*  LUIS  UCS  MODULE  1 


to  =-  FOT  If' 


*  i_i  ‘ !  to 


I  --  £  0  1  «  ? 

0 ,  0 


«  LU  I  7 


15  -  LOT  #? 


l  9  -  LOT  tt? 


*  i_UIS 


*  LUIO 


20  =  EOT  tt  ‘ 
1  ,  0 


■*  LUZO  DISK  SUBOMriNEL  0 


21  =  b.W  I  *? 

1 , 2 


•*  LU21  DISC  SUBCHhnDLL  2 


22  *  COT  #? 


2  2  =  EOT 


2 ‘I  =  EOT 


2s  -  cu  r  »t  i' 


2o  =  EOT  ft? 


*  j  r  i  i  E  K'-’iJO  i  1  *'”■< 


t  i  ti  l  fc  f-'HUT'  I  7  »'iEl  i- 


-i ,  tim ,  rPOWK 


!  .  Ef  r , 


*  U  lit.- 


*  i.K-  < 


*  LU2'4 


.*  i. n 2 5  E.iIl 


i  V  :  i  I 


I 


PF'CnG*  276,  >44347  44-73 

I 

! 

(  RT  DISC  RESIDENTS 


■•;  NONE  > 


GG  DISC  RESIDENTS 


f$CrttX  000 1 

>4  0  0  t>2 

4  *  .*>  -1 

*'2060-  160?s  REV  17’ 0 

pmerR 

4  1  3  53 

4  1  372 

7  7  1 t  >■*  1  c  3  4  *  * —  1  c.  -  1  *  0  1 

FI'IGR  0  09  Ci 

>4  o  o  OS' 

4  0757 

6  ~  ij  0  2—1  3  0  t1  "■  E  E  V  1631 

FM .  CM 

4  0  76  0 

4  -'o  7  1 

82  0  02 -  1  6  0  OS  7 6  06  l  6 

.  DRCT 

42?  rs 

4  ?U  04 

•*»2 1>  0  1  -  1  6  0  06  74  1  1  2  0 

IFbRk 

4  30  to 

4  3  t‘7c 

02  o  0  1  -  i  •_-•  0  05  7-111  2 1' 

.  DFER 

4  30:7:7 

4  3  1  0 u 

:-0  7  0  1  24  5SS-  1  oOO  ; 

OPEN 

4 Si  t« i 

4  5jfcc 

6_  u  02  - 1  6  0  u 6  74  1  2 06 

RMPRR 

4  3  io  6  7 

4  3324 

7  ?  0  3  1  2  2 4  8  9  S  -  1  6  0  0  1 
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APPENDIX  F 


List  of  Commands  used  for  back  in;'  up  System  on  Disk. 

It  is  assumed  the  JDISKL'P  program  is  on  a  minicartridge.  The  follow¬ 
ing  is  a  list  of  steps  required  to  load  and  run  IDISKUP  for  the  micro¬ 
programming  system  configuration  developed  for  this  study. 

1.  To  load  ! DISKUP  from  the  minicartridge:  insert  the  cartridge  , 

into  the  graphic  terminal  and  set  the  computers  S-register  to  04302 

octal.  Press  Preset/IBL.  Press  RUM. 

Program  should  now  load  into  computer  and  then  HALT  with  102077 
octal  displayed  on  the  S-register  display. 

2.  After  the  program  is  loaded,  store  an  octal  13  into  the  S- 
register  and  set  the  P-register  to  an  Octal  2.  Press  PRESET/RL’M.  The 
program  will  now  execute  interactively  through  the  graphics  terminal. 

Questions  will  be  presented  on  the  graphics  terminal.  You  must  sup¬ 
ply  the  answers. 

3.  The  following  is  a  list  of  questions  and  the  answers  used  in 
this  study  to  backup  the  operating  RTE-I1I  system  and  other  system  files. 

It  requires  running  two  passes  of  IDISKUP.  The  first  pass  copies  the 
operating  system  from  cartidge  1  to  the  removable  disk  cartrige.  The 
second  pass  copies  the  system  files  on  cartridge  10  to  the  removable 
disk  cartridge. 
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QUESTION'S 

TASK? 

SOURCE  DISK  CHANNEL  •'-? 
SOURCE  DISK  TYPE? 

SOURCE  DISK  DRIVE? 

TYPE  OF  COPY? 

RTE  or  DOS  DISK? 

FROM  CYLINDER  #? 
if  of  TRACKS? 
if  of  SURFACES? 

STARTING  HEAD? 

DEST.  DISK  HEAD? 

TO  CYLINDER  if 1 
if  SURFACES? 

STARTING  HEAD  #? 

6144  WORD  BUFFER  DESIRED? 
VERIFY* 


ANSWERS:  1st  pass  2nd  pass 

CO  CO 

12  12 

7906  7906 

0  •  0 
FR  FR 

RT  RT 

0  0 

256  256 

2  2 

2  2 

0  0 

0  132 

2  2 

0  0 

YES  YES 

YES  YES 


*  At  this  point  hit  any  key  for  second  pass:  SDISKUP  will  respond 
with  the  second  set  of  questions. 
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APPENDIX  G 


Discussions  of  Implementation  Problems  Tr.countered  During  Study 

Many  minor  operating  problems  which  will  not  be  described  here 
were  encountered  during  the  implimentation  and  microprogram  develop¬ 
ment  phases  of  this  study.  In  the  most  part  these  operating  problems 
were  due  to  a  lack  of  adequate  and  readily  available  HP  documentation. 
Once  the  proper  HP  documentation  and  operating  procedures  were  located 
many  initial  operating  problems  were  resolved.  Technical  discus¬ 
sions  with  other  HP  users  on  base  also  helped  tremendously  in  getting 
started  and  in  identifying  appropriate  HP  documentation. 

In  addition  to  all  the  operating  problems  encountered  four  major 
implementation  problems  occured.  The  following  four  events  are  pre¬ 
sented  as  an  example  of  the  Implementations  problems  encountered.  The 
corrective  action  taken  is  listed  where  applicable. 

I 

First  the  base  instruction  set  module  proved  to  be  a  major  ini¬ 
tial  problem  with  booting  the  c.ompucer  up  with  the  RTE  operating  sys¬ 
tem.  The  computer  would  not  boot  up  properly.  Usually  several  attempts 
were  required  to  make  the  computer  boot-up  correctly.  After  boot-up 
the  system  acted  highly  unstable  generally  resulting  in  the  computer 
halting.  The  problem  was  traced  to  the  base  instruction  set  board. 

The  instruction  board  initially  installed  in  the  CDU  was  an  older 
version.  The  solution  was  to  replace  this  board  with  a  later  version 
board  available  in  a  second  CPU  unit  at  AFIT.  Using  the  new  base  in¬ 
struction  board  cleared  up  the  boot-up  problem. 

A  second  problem  occurred  shortly  after  replacing  Uv'  base  Ln- 
•itructi-'n  .  ;  the  main  r  ■  I  -m  .  :\-rt  un- 
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ately  the  solution  was  to  use  the  second  CPU  power  supply  unit  avail¬ 
able  at  AFIT  while  a  new  power  supply  was  ordered  froin  HP.  However, 
this  meant  disassembling  and  reassembling  the  entire  computer  to 
change  the  main  frame. 

The  third  problem  occurred  with  the  papertape  photo  reader.  Ini¬ 
tially  much  of  the  software  available  at  AFIT  was  still  on  papertapes. 
However  the  papertape  reader  was  difficult  to  adjust  properly.  It  would 
only  work  with  black  tapes.  Keeping  the  optics  clean  helped  eliminate 
some  problems  of  reading  the  tapes.  A  better  solution  was  initiated  by 
installing  a  leader  ROM  to  read  magnetic  tape  cartridges  from  the  sys¬ 
tems  HP  2648  graphics  terminal.  Transferring  needed  programs  to  magne¬ 
tic  tape  cartridges  solved  the  problem  of  reading  data  and  programs 
from  punched  tapes. 

The  fourth  problem  was  how  to  obtain  a  hardcopy  printout  of  a  pro¬ 
gram.  This  was  partly  solved  by  interfacing  a  TI  TTY  printer  to  the 
computer.  However  the  unit  would  only  operate  at  110  band.  Initial 
attempts  to  make  it  operate  at  300  band  failed.  It  was  later  learned 
the  crystal  on  the  interface  board  was  not  correct.  Obtaining  the  pro¬ 
per  crystal  solved  the  problem  and  the  TTY  printer  then  worked  with 
either  110  or  300  band. 


APPENDIX  H 


How  to  Run  ACTV  Program  cn  RTE-IVB  System 

User  instructions: 

The  FORTRAN  Program  ACTV  must  be  compiled  and  loaded  before  it 
can  be  run.  Also  the  assembly  subroutine  RCORE  must  be  assembled  and 
loaded  with  ACTV.  Compiling  or  assembling  a  program  create  binary 
files  of  the  programs.  It  is  assumed  the  source  for  ACTV  is  on  file 
&ACTV  and  the  source  for  RCORE  is  on  file  &RCORE.  If  ACTV  must  be 
compiled  this  can  be  cone  by  typing: 

j_RU,  FTN4,  5. ACTV,,  %ACTV 

If  RCORE  must  be  assembled  this  can  be  done  by  typing: 
jRU,  ASMB,  &RCORE, ,  T'RCORE 

Once  ACTV  is  compiled  and  RCORE  is  assembled  the  programs  must  be 
loaded.  This  is  done  by  typing:  • 

jJRU,  LOADR 

/LOADR:  RE,  %ACTV 

RE,  %RCORE 
/E 

After  loading  the  programs  you  must  set  the  program  priority  to 
be  less  than  the  file  manager  system  program  (FMCR) .  Usually  the  prior¬ 
ity  of  FMCR  is  set  at  90.  You  must  set  ACi'V  to  a  priority  of  89  or 
less.  The  priority  of  ACTV  is  set  by  typing  on  the  system  console: 
jPR,  ACTV,  89 

Before  you  can  run  the  ACTV  program  you  must  also  load  the  program 
you  wish  to  monitor.  Then  assuming  the  program  you  are  monitoring  has 
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been  loaded  ACTV  can  be  run  by  typing: 

:RU,  ACTV 

Once  running,  ACTV  will  ask: 

ACTIVITY  PROFILE  GENERATOR 
TYPE  PROGRAM  NAME : 

You  must  type  in  the  name  of  the  program  you  are  monitoring  at 
this  point.  Then  the  program  will  ask: 

TYPE  BOUNDS  OF  ACTIVITY  PROFILE,  LOWER-UPPER 
XXXXX  XXXXX  X 

You  must  now  type  in  the  address  range  in  the  monitored  program  you 
wish  to  examine.  The  address  range  is  obtained  from  the  load  map  of  the 
monitored  program.  The  load  map  is  generated  when  the  monitored  program 
is  loaded. 

After  entering  the  address  ranges,  hit  any  key.  This  will  cause  the 
computer  to  issue  the  command  prompt.  At  this  point  type  in: 

jRU,  (name  of  program  being  monitored) 

The  monitored  program  will  now  execute  in  its  normal  way.  If  the 
program  is  interactive  you  supply  the  data  requested.  When  the  program 
terminates,  hit  any  key.  The  command  mode  will  appear,  then  type  the 
following: 

J_BR  (Break) 

The  ACTV  program  will  terminate  and  printout  a  histogram  and  cumula¬ 
tive  plot  of  the  activity  monitored  the  address  ranges  you  specified. 

To  zoom  in  on  a  program  you  must  rerun  ACTV  with  the  address  ranges 
narrowed  in  on  the  locations  desired. 
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APPENDIX  I 


Sample  Profile  Analysis 

This  Appendix  contains  the  activity  profile  analysis  performed  for 
the  F0UR1  and  FOURE  FFT  programs.  The  address  load  map  for  the  two  FFT 
programs  is  given  in  Table  VI.  The  results  of  the  activity  profile  gen¬ 
erator  program  ACTV  for  the  F0UR1  program  is  listed  in  Table  VII.  Figure 
12  shows  the  F0UR1  results  plotted.  The  zeros  represent  a  histogram 
of  the  activity  and  the  I  symbols  plot  the  cummulative  distribution. 
Likewise  Table  VIII  contains  the  activity  results  obtained  for  the  FOURE 
program.  Figure  13  is  a  plot  of  the  FOURE  activity  results  contained  in 
Table  VIII. 

F0UR1  PROFILE  ANALYSIS 

Bit  Reversal:  The  bit  reversal  software  is  computed  between  the  ad- 

» 

dress  range  56363-56515  (Interval  if 3-12).  The  activity  profile  in  Table 
VII  shows  38  bits  occured  in  this  address  range.  This  amounts  to  approx¬ 
imately  38/1044  or  3.64%  activity  for  the  bit  reversal  software. 

Sine :  The  Sine  values  are  computed  in  the  address  range  57023  - 

57122  (Interval  35-41).  The  profile  in  Table  VII  shows  216  bits  occured 
in  this  range  out  of  a  total  of  1044  bits.  Thus  the  sine  computation 
takes  approximately  216/1044  or  20.68%  of  the  total  activity  shown  in 
the  profile. 

Butterfly:  The  address  range  56636  -  57023  (Interval  23-34)  com¬ 

putes  the  butterfly  values.  As  shown  in  the  profile  of  Table  VII  764  bits 

occured  thus  the  butterfly  for  the  FOUR],  program  takes  roughly  764/1044  or 
73.18%  of  Liu.  total  activitv. 
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Overhead :  All  other  activity  amounted  to  24  bits.  This  is  24/1044 

or  2.29 %  of  the  total  activity. 

FOCRE  PROFILE  ANALYSIS 

Bit  Reversal:  The  address  range  55534-55656  (Interval  8-20)  com¬ 

putes  the  bit  reversal.  The  activity  profile  in  Table  VIII  shows.  102  hits. 
Hence  the  bit  reversal  activity  is  approximately  103/584  or  17.6%. 

S ine :  The  address  range  55706-55733  (Interval  24-26)  computes 

the  Sine  values.  The  activity  profile  in  Table  VIII  shows  79  hits  occured 
in  this  range  out  of  a  total  584  hits.  Thus  the  sine  computation  takes 
approximately  13.5%  of  the  activity. 

Butterfly:  The  activity  in  this  section  is  roughly  divided  into 
two  sections.  First,  the  CONTINUE  statements  110  and  120  in  the  DO  loop 
for  the  butterfly  computation  show  the  following  hits: 

110  CONTINUE  56023-56032  95  hits 

120  CONTINUE  56032-56041  13  hits 

TOTAL  108  hits 

Hence  the  CONTINUE  statements  take  10S-5S4  or  18.495  of  the  total 
activity. 

Second,  the  address  range  55742  to  55623  computes  the  following  part 
of  the  butterfly  code. 

TEMP=W*DATA 
D  ATA=*  D  AT  A- T  EMP 
DATA-DATA+TEMP 

The  activity  profile  in  Table  VIII  shows  260  hits  occured  in  this  ad¬ 
dress  range.  Hence  260/584  or  44.86%  activity  in  this  area. 

The  total  butterfly  therefore  shows  approximately  63.35%  activity. 
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Overhead; 
some  activity. 
33  hits.  Hence 


The  overhead  Is  all  the  other  address  ren8es  which  showed 
5UBpl„g  the  activity  of  .U  the  other  sections  resulted  In 
the  overhead  amounted  to  about  33/531  or  5.65%. 


Table  VI 


Address  Load  Map  for  F0UR1  &  FOURE  FFT  Programs 


FFTC2 

34042 

55453 

FOURE 

SS4S4 

56203 

RTIME 

56204 

56351 

FOlJRi 

56352 

57201 

PAUSE 

57202 

57301 

24998-1X253 

REV. 2101 

801007 

RMPAR 

57302 

57346 

92068-1X025 

REV. 2101 

800919 

.DIO. 

57347 

57430 

24998-1X331 

REV. 2101 

800929 

.  EIO. 

57431 

60655 

24998  1X329 

REV. 21 01 

801107 

.  FMCV 

69656 

63120 

24998-1X333 

REV. 2101 

800709 

FMTIO 

63121 

64352 

24998-1X328 

REV . 2101 

800929 

.  I  DER 

64353 

64466 

24998-1X321 

REV. 2101 

800731 

.  UFMP 

64467 

64501 

24998-1X296 

REV  .2.101 

800731 

RE  10 

64502 

64626 

92067-1X275 

REV. 2013 

790316 

PAU.E 

64627 

64627 

24998-1X254 

REV. 2001 

750701 

PNAME 

64630 

6470  0 

92068-1X035 

REV. 2101 

800919 

.  CTOI 

64701 

65040 

24998-1X035 

REV. 2013 

791022 

.  ENTC 

65041 

65073 

24998-1X155 

REV. 2001 

750701 

.  CFER 

65074 

65135 

24998-1X196 

REV. 2001 

790523 
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Activity  Profile  results  for  FOURE  FFT  Program 
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Figure  13.  FOURE  Activity  Profile  Plot 
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APPENDIX  J 


Appendix  J  contains  a  listing  of  the  microprogrammed  bit  reversal 
sorting  routine  developed  during  this  study.  The  bit  reversal  routine 
was  named  BINV.  Also  included  in  this  Appendix  is  the  Assembly  inter¬ 
face  routine  that  links  the  microprogram  with  the  FORTRAN  FFT  calling 
program.  A  flow  diagram  showing  the  algorithm  for  the  BINV  program  is 
also  included  in  this  Appendix  in  Figure  14. 
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1 


MICMX , L ,C 

ORG  600SB 

1iC  ODE= '  MSB  I N  :  :  It)  >  R  EP  LACE 

* 

*  THIS  MICROPROGRAM  PERFORMS  THE  BIT  REVERSAL 

*  AND  EXCHANGES  COMPLEX  FLOATING  POINT 

*  DATA  FOR  COMPLEX  RADIX  2  FFT  PROGRAMS 

* 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

He 

* 

* 

* 

# 

* 

* 

♦ 

# 

* 

* 

* 

* 

* 

* 

* 


THE  CALLING  SEQUENCE  FROM  FORTRAN  IS: 

CALL  BINV <  N  >  NN  >  DATA ) 

WHERE  N  -  NUMBER  OF  COMPLEX  DATA 

NN -  POWER  OF  2  FOR  FFT<N=2**NN) 

DATA —  COMPLEX  DATA  ARRAY  (OR  DATA 
ARRAY  WITH  ALTERNATING  REAL 
AND  IMAGINARY  PARTS) 


PARAMETERS  N ,  NN ,  AND  DATA  ARE  PASSED 
TO  THE  MICROPROGRAM  THROUGH  AN  ASSEMBLY 


INTERFACE  PROGRAM. 

CALLING  SEQUENCE  I 

JSB  . ENTR 
DEF  N 
DEF  NN 
LDB  N , I 
L.I)A  NN ,  I 
I..DX  DATA 

OCT  10560S 


B-REG  CONTAINS  N  VALUE 
A--REG  CONTAINS  NN  VALUE 
X-REG  CONTAINS  STARTING 
ADDRESS  OF  DATA  ARRAY 
INVOKES  MICROPROGRAM 
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I 


IS 


THE  ALGORITHM 
THE  BIT  REVERSAL 


ING  POINT 
TRANSFORM 


M I CR OP R OGR AM  IMPLI MENT S 
PAR  ISON,  AND  COMPLEX  FLOAT- 
DATA  EXCHANGE  FOR  A  RADIX  2  FAST  FOURIER 
ALGORITHM  IS  AS  FOLLOWS.- 


PROGRAM. 


THE 


STEP  1 


FORM  THE 
REGISTER 
REVERSED 


BIT 

S2  . 


REVERSED  INDEX  NUMBER  IN 
THE  NUMBER  TO  BE  BIT  RE¬ 
IN  REGISTER  54. 


STEP 


^  iff  ^  ^  ^  JK  ^  ^  ^  ^  ^  *|(  i|(  ^  )K  ^  ^  )|(  )|(  *  )j(  ij(  *  if.  ^  )|i )(( )|(  )|(  ij(  )^(  )f[ 

4  'X*  ^  X  X  ^  W  -X  J,  ^  ^  ^  ^  ■  L'  ^  <As  yb  Aj  ^  -Aj  ^  df  ^  ^  '[/  ^  ^  ^  .  L.  Aj  .1,.  .d  d'  ^  ^  ^  ^  ^  ^  Jj  ^  ^  >b  ^  X  ^  -U  d.  ^ 

^  ®  ®  ^  ^  ^  ®  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  »p  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  /p  ^  ^ 

* 

*  ALGORITHM;  BIT  REVERSAL  SORTING 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

*  ' 

* 

)|(  ]|(  )|(  )|(  ^  ^  ^  ^  ^  ^  j|^  ^  ^  ^  ^  ^  ^  y^  y|^  ^  yj^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  y^  y^  ^  ^  ^  ^  ^  y^  ^  ^ 


COMPARE  THE  BINVERTED  INDEX 
WITH  DATA  INDEX  Y.  IF  S2  IS  GREATER 
THEN  Y  THEN  EXCHANGE  THE  COMPLEX  DATA 
"IN-PLACE".  FIRST  FORM  THE  ADDRESSES 
FOR  THE  DATA  EXCHANGE  AND  THEN  JUMP 
TO  THE  SWITCH  ROUTINE  LABLE  -  SW . 

THE  DATA  ADDRESS  FOR  THE  INDEX  Y  IS 
STORED  IN  THE  A  REGISTER .  THE  DATA 
ADDRESS  FOR  THE  BIT  REVERSED  NUMBER 
IS  STORED  IN  THE  B-REGISTER. 


STEP  3, 


STEP  4 


STEP  5 


IF 

DO 


S2  IS  LESS  THAN  OR  EQUAL  TO  Y  THEN 
THE  NEXT  INDEX  NUMBER  Y 


CHECK  TO  SEE.  IF  ALL  DATA  POINTS  HAVE 
BEEN  DONE.  CONTINUE  STEPS  i-3  UNTIL 
ALL  DATA  HAS  BEEN  PROCESSED. 


AFTER  ALL  DATA  HAS 
THEN  RESTORE  VALUE 
THE  M-REGTSTER  AND 
PROGRAM. 


BEEN  EXCHANGED 
OF  P -REGISTER  AND 
RETURN  TO  THE  CALLING 
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ft*************************************************************** 

* 

*  BEGIN  BIT  REVERSAL 

* 

* 


MBINV 

Si  i 

P 

SAVE  THE  RETURN  ADDRESS 

DEC 

Si  2 

B 

SAVE  N-i  IN  S1.2 

SiO 

A 

SAVE  NN  IN  SiO 

ZERO 

Y 

INITIALIZE  INDEX  COUNTER 

NEXTY 

S4 

Y 

PUT  INDEX  COUNT  IN  S4 

ZERO 

S2 

INITIALIZE  BINVERT  REGISTER 

ss 

SiO 

PUT  NN  COUNT  IN  SS 

LOOP 

Li 

S2 

S2 

LEFT  SHIFT  S2  i  BIT 

84 

S4 

CHECK  INDEX 

.IMP 

CNDX 

ALO 

RJS 

*+2 

ALO  BIT— JMP  NEXT  INST 

* 

IF  ZERO 

INC 

S2 

S2 

ADJUST  BINVERT  COUNT 

Ri 

S4 

S4 

RIGHT  SHIFT  INDEX 

DEC 

SS 

SS 

ADJUST  BIT  COUNT  NN=NN-i 

JMP 

CNDX 

TBZ 

*+2 

IS  COUNT  ZERO?  YES, JMP  INST 

JMP 

LOOP 

NO! ! !  NOT  DONE  YET! ! ! 

* 

* 

^  Ui  |A‘  ^  ^  ^  ^  ^  X  ^  ^  ^  ^  U/  vjL'  t  h  <  It  ^  ^  ^  ^  ^  dt  ^  iL  ^  ^  ^  ^  J,  d.  tjl  tl.  d.  ^  ^  (L  ^  ^  ^  ^  ^  dh  yL  ■!■  «|j  J*  dt  ^  ^  yU  d.  dl  |L  ^  dj  dd  yd-  b  d(  ^  tit  tit 

*  ^  *•  ^  »p  *  ^  ^  ^  ^  ^  ^  ®  ^p  ^  ^  ^  ^  ^  ^p  ®  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  rp  ^  ^  ^  ^  ^ 

* 

*  NOW  COMPARE  BINVERTED  INDEX  WITH  DATA  INDEX 

* 

* 

I...  S2  PUT  BINV  *  INDEX  IN  I..-REG 

SUB  Y  SUBTRACT  INDEX*  FROM  BINV  * 


JMP  CNDX 

ALiS 

RJS 

CONTINUE 

IF (BINV*> INDEX) 

* 

* 

THEN  EXCHANGE  DATA 

4* 

tb  yj^  dt  dl  \b  X  t ^1  ^  ^  ^  t Jtf  ^  ^  ^  d>  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  dt  tb  dt  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d 

•  p  ®  ^  ®p  *  *  ^  ^  ^  ^  ^  ^  ^p  ^p  ^p  ^p  ^p  ^p  ^p  ^p  ^p  ^  ^  ^p  ^p  ^p  ^p  ^p  ^p  ^p  ^p  ^  ^  ^  ^p  ^  ^  ^  ^p  ^  ^p  ^  ^p  ^p  ^  /p  ip  ^  ^  ^p  ^p  ^  ^p  ^p  ^  ^p  ^  ^  ^  ^  ^ 

* 

n* 

*  NOW 

* 

FORM  THE  ADDRESSES 

5  FOR 

THE  DATA 

EXCHANGE 

* 

SS 

Y 

Li 

SS 

85 

FORM  Y  INDEX*4 

L.  i 

SS 

SS 

Li 

S2 

82 

FORM  BINV*  INDEX #4 

* 

Li 

S2 

S2 

* 

1. 

X 

PUT  DATA  ADDRESS  IN  L  REG 

ADD 

A 

SS 

A-ADDRESS  DATA( INDEX) 

ADD 

B 

S2 

B~ ADDRESS  DATA (BINV*) 

* 

1^  ^  ^  dl  iL  ^  dl  ^  ^  X  ^  X  ^  ^  ^  y^  I  Lt  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  dt  ^  dt  tit  d.  ^  d.  ^  ^  tb  ^  dt  tb  dt  d.  t  bt  dt  dt  yU  d.  ^  yb  ^  ^  ^  tdt  ^  ^  ^  tb  ^  yl  '  d  .  I.  dt  ^ 

^  T*  *  ^  ^  ^  ^  T  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  V  ^  I*  '  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^ 

* 
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♦NOW  JUMP  TO  SWITCH  ROUTINE 

* 

* 


Us 

JMP 

SW 

SWITCH  COMPLEX  DATA 

* 

^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  yj^  ^  ^  ^  ^  ^  ^  ^  yj^  y^|  y^  y^/  ^  y^  ^  ^  ^  ^  ^  y^  ^  y^  ^  y^  ^  y^f  ^  ^  ^  ^  ^  y^  ^  y^  y^j  ^  ^  ^  ^  y^ 

* 

O' 

CONTINUE 

1 

Si,? 

NOW  CHECK  TO  SEE 

SUB 

Y 

IF  ALL  DATA  POINTS 

JMP 

CNDX  TBZ 

.  DONE 

ARE  DONE 

INC  Y 

Y 

NO! M BUMP  INDEX  Y  COUNT 

* 

* 

JMP 

NEXTY 

AND  DO  NEXT  INDEX 

* 

* 


.  DONE 

P 

SI  i 

RESTORE.  P 

READ  RTN  DEC  M 

Si  1 

AND  M  REGISTER  AND 

* 

RETURN  TO  MAIN  PROGRAM 

* 

* 

]|(  )$C  ijl  ff.  3jc  5^C  )|(  )|(  )(c  5^C  *  ]|[  ](( )jc  )(C  )|(  *  i(C  )j(  ]|(  ]|[  *  *  *  )J(  ]{( % )(( *  S^l  )|( )(( )|(  l|(  )j{  >K  lj{  5^C  )|(  JfC  )^£  )^t  )(( )(t  )j(  ]f(  )K  ))C  )^C 

* 

% 

* 

*  DttTA  EXCHANGING  PROCEDURE 

* 

* 

* 

*  THE  MAIN  MICROPROGRAM  PASSES  = 

*  ADDRESS  OF  DATA< INDEX)  IN  A  REG. 

*  ADDRESS  OF  DATA(BINU*)  IN  B  REG. 

* 

*  LET  INDEX  =  T. 

*  LET  BINV*  =  J 

* 


* 

* 

* 

SUI  P  A  PUT  ADDRESS  DATA  (DIN  P 


READ 

INC 

PNM 

P 

START 

A  READ , P  ~  > M , P  =P  •♦•  I 

SI 

TAB 

STORE 

REAL 

DATA ( I )  WORD! 

IN 

Si 

READ 

INC 

PNM 

P 

START 

READ 

,  P  -  >  M ,  P  -P  +  i 

S2 

TAB 

STORE 

REAL 

DATA ( I )  WORDS 

IN 

sa 

READ 

INC 

PNM 

P 

START 

READ 

P ™>M,P=P«1 

S3 

TAB 

STORE 

IMAG 

DATA(I)  WORD 3 

IN 

S3 

READ 

INC 

PNM 

P 

START 

READ 

P->M,P=P  +  1 

S4 

TAB 

STORE 

IMAG 

DAT  A  < I )  W0RD4 

IN 

S4 
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*  *■ 


* 


P  B  PUT  DATA(J)  ADDRESS  IN  P-REG 


READ 

INC 

PNM 

P 

READ  DATAC J ) WORD! , P- >M , 

P=P  +  1 

SS 

TAB 

STORE 

DAT  A ( J )  WORD!  IN 

SS 

READ 

INC 

PNM 

P 

START 

READ  P->M,  P=P  +  i 

S6 

TAB 

STORE 

DAT  A ( J )  WORDS  IN 

S6 

READ 

INC 

PNM 

P 

START 

READ,P->M,  P=P+ 1 

S7 

TAB 

STORE 

DAT  A ( J )  W0RD3  IN 

S7 

READ 

M 

P 

START 

READ , P- >M ,  P=P  +  1. 

SO 

TAB 

STORE 

DATA<  J )  W0RD4  IN 

SO 

* 

* 

* 

♦  NOW  PERFORM  SWITCH  ON  DAT  A ( I )  AND  DAT  A ( J ) 

* 

♦ 

* 


DEC 

PNM 

P 

L  AST  F 

>-->M,  P=P-i 

WRTE 

r 

S4 

WRITE 

IMAG  DATA(I)  WORD4 

DEC 

PNM 

P 

START 

WRITE  P  --  >  M ,  P  =  P  -  i 

WRTE 

r 

S3 

WRITE 

IMAG  DAT  A ( I >  W0RD3 

DEC 

PNM 

P 

START 

WRITE,  P-->M,P=P--i 

WRTE 

T 

SS 

STORE 

REAL  DATA ( I )  WORDS 

DEC 

PNM 

P 

START 

WRITE  P-  >  M ,  P=P--1. 

WRTE 

T 

Si 

WRITE 

REAL  DATA ( I )  WORD! 

♦NOW  SWITCH  DATA(J)  WORDS 

♦  BUT 

♦FIRST  RESTORE  PROPER  STARTING  ADDRESS 

♦ 

P  A  ADDRESS  OF  DATA(I)->P 

♦ 

♦ 


INC 

PNM 

P 

START 

WRITE 

WRTE 

T 

SS 

STORE 

REAL  DATA( J) 

WORD 

.1. 

INC 

PNM 

P 

START 

WRITE 

WRTE 

T 

S6 

STORE 

REAL  DATA(J) 

WORD 

o 

r... 

INC 

PNM 

P 

START 

WRITE 

WRTE 

T 

S7 

STORE 

IMAG  DATA(J) 

WORD 

3 

INC 

PNM 

P 

START  WRITE 

WRTE 

T 

SO 

STORE 

IMAG  DATA(J) 

WORD 

4 

♦ 

♦ 

♦ 


♦ 

♦ 

♦ 

♦ 

♦ 

JMP 

END 


END  OF  EXCHANGE  ROUTINE 
CONTINUE  WITH  NEXT  INDEX 


CONTINUE 


A SMB , L , C 

* 

* 

*  THIS  ASSEMBLY  PROGRAM  PASSES 

*  THE  PRAMATERS  N,  NN,  AND  DATA 

*  TO  A  MICROCODED  BIT  REVERSAL 

*  ROUTINE.  THE  DATA  ARRAY  CONTAINS 

*  COMPLEX  FLOATING  POINT  NUMBERS 

* 


THE 
IS  : 


CALLING  ROUTINE  FROM  FORTRAN 


CALL  BINV<N,  NN ,  DATA) 


NAM 

BINV, 7 

* 

ENT 

BINV 

EXT 

.  ENTR 

N 

BSS 

i 

NN 

BSS 

i 

DATA 

BSS 

1 

BINV 

NOP 

JSB 

ENTR 

DEF 

N 

DEF 

NN 

* 

* 

* 

L.DB 

N  >  I 

IDA 

NN ,  I 

L.DX 

DATA 

* 

# 

* 

EXE 

CUTE  THE 

* 

* 

MICROCODE  S 

* 

WCS 

ADDRESS 

N  value: 

NN  VALUE 
DATA  STARTING 


THE  MICROCODE 

STARTS  AT 
6  0  0  SB 


^  /p  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ®  ^  ®  *  *  *  *  *p  *  ^  *  *  * 

OCT  1.0  56  OS 

JMP  BINV , I 
END 
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APPENDIX  K 


Theoretical  Calculation  of  Microprogrammed 
Bit  Reversal  Algorithm  (BINV)  Execution  Time 

Micrprograms  tend  to  be  short  programs.  The  microprogrammed .bit 
reversal  programmed  for  this  study  was  70  statements  long.  The  21MX 
basic  cycle  time  to  execute  one  microinstruction  is  325  nsec.  long. 

It  is  therefore  simple  to  calculate  the  run  time  for  the  microprogram 
developed  by  noting  the  number  of  statements  and  the  number  of  times 
they  are  executed.  This  is  done  as  follows: 

1.  To  form  the  bit  reversed  number  required  8  microinstructions. 

N 

This  code  loops  through  N(2  )  times.  Therefore  we  have 

Time=  2*(N)  x  8  x  325  x  10~9  =  2.6  x  10~6  N(2N)  sec.  (4) 

2.  To  set  up  and  initialize  registers  required  4  microinstruc¬ 

tions  which  execute  only  once.  Therefore  the  time  for  these  microin¬ 
structions  is:  ' 

Time  -  3  x  325  x  10_9  =  1.3  x  lO-6  sec.  (5) 

3.  To  form  the  data  exchange  addresses  required  8  microinstruc¬ 
tions.  This  code  loops  through  NN  times,  where  NN  is  the  number  of  times 
the  bit  reversed  index  is  greater  than  the  number  that  was  bit  reversed. 
Thus  the  time  is: 

Time  -  8  x  325  x  10-9  NN  =  2.6  x  10-6  NN  sec.  (6) 

4.  To  perform  the  complex  data  oxchnage  required  37  microinstruc¬ 
tions.  These  microinstructions  are  executed  NN  times.  Thus  the  time 
required  to  execute  these  instructions  is: 

Time  -  37  x  325  x  10-9  NN  =*  12.025  x  10~6  NN  sec.  (7) 
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5. 


To  perform  Che  bit  reversal  compare  required  3  microinstruc- 

N 

tions.  These  statements  execute  2l  times.  Hence  the  time  required  to 
execute  this  code  is: 

Time  =*  3  x  325  x  10-9  x  2N  -  9.75  x  ICf7  x  2N  (8) 

6.  To  form  the  next  index  number  required  8  microinstructions. 

N 

They  are  executed  2  times.  Therefore  the  time  to  execute  these  in¬ 
structions  is: 

Time  =  8  x  325  x  10~9  x  2N  =  2.6  x  10“6  x  2N  (9) 

7.  To  reset  the  P  and  M  registers  and  return  to  the  calling  pro¬ 
gram  required  2  microinstructions.  These  instructions  execute  only  once, 
after  the  routine  is  done.  The  time  is: 

Time  =  8  x  325  x  10-9  =  0.65  x  10~6  sec.  (10) 

Combining  the  above  executions  times  gives  the  total  run  time  for  the 
microcoded  routine.  Hence  the  run  time  is  given  by: 

Run  Tine  =  (2.6N  +  3.575)2N  +  14.6NN  +  1.95  microseconds  (11) 
Using  Eq  11  the  run  time  for  the' microcoded  bit  reversal  sorting 
routine  can  be  theoretically  calculated.  The  run  times  for  various  num¬ 
ber  of  points  of  the  FFT  are  shown  below: 

N 

N _ 2 _ NN _ Run  Time  (microseconds) 


2 

4 

1 

51.4 

3 

8 

2 

121 

4 

16 

6 

311 

5 

32 

12 

704 

6 

64 

28 

1630 

7 

128 

56 

3590 

8 

256 

120 

7994 

) 

)  t  J 

’  » 

17  31/ 
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